SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: MAERTENS, GEERT 
BOSMAN, FONS 
DE MARTYNOFF, GUY 
BUYSE, MARIE-ANGE 

(ii) TITLE OF INVENTION: PURIFIED HEPATITIS C VIRUS ENVELOPE 
PROTEINS FOR DIAGNOSTIC AND THERAPEUTIC USE 



(iii) NUMBER OF SEQUENCES: 111 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: NIXON & VANDERHYE P.C. 

Q (B) STREET: 1100 NORTH GLEBE ROAD 

^ (C) CITY: ARLINGTON 

® (D) STATE: VIRGINIA 

(E) COUNTRY: U.S.A. 

Si (F) ZIP: 22201-4714 

a (v) COMPUTER READABLE FORM: 

e^y (A) MEDIUM TYPE: Floppy disk 

1^ (B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 
^ (D) SOFTWARE: Patentin Release #1.0, Version #1.25 (EPO) 

□ (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/612,973 
^ (B) FILING DATE: ll-MAR-1996 

^ (C) CLASSIFICATION: 

P 

M (viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: BYRNE, THOMAS E. 

(B) REGISTRATION NUMBER: 32,205 

(C) REFERENCE /DOCKET NUMBER: 1487-10 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (703) 816-4000 

(B) TELEFAX: (703) 816-4100 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



HCVENV.-T 



m 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
GGCATGCAAG CTTAATTAAT T 21 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
^ (iii) ANTI-SENSE: NO 

m 

\| (xi) sequence description: seq id no: 2: 

ycggggaggc ctgcacgtga tcgagggcag acaccatcac caccatcact aatagttaat 60 
Waactgca 

b{2) information for SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 642 base pairs 
_ (B) TYPE: nucleic acid 

□ (C) STRANDEDNESS: single 

S (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..639 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1..636 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTA CTG TCC TGT 48 
Met Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Leu Ser Cys 
15 10 15 

CTG ACC ATT CCA GCT TCC GCT TAT GAG GTG CGC AAC GTG TCC GGG ATG 96 
Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val Ser Gly Met 
20 25 30 

TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG TAT GAG GCA 144 
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Tyr 


His 


Val 
35 


Thr 


Asn 


Asp 


Cys 


Ser 
40 


Asn 


Ser 


Ser 


He 


Val 
45 


Tyr Glu Ala 




GCG 
Ala 


GAC 
Asp 
50 


ATG 
Met 


ATC 
He 


ATG 
Met 


CAC 
His 


ACC 
Thr 
55 


CCC 
Pro 


GGG TGC 
Gly Cys 


GTG 
Val 


CCC TGC 
Pro Cys 
60 


GTT CGG GAG 
Val Arg Glu 


192 


AAC 
Asn 
65 


AAC 
Asn 


TCT 
Ser 


TCC 
Ser 


CGC 
Arg 


TGC 
Cys 
70 


TGG GTA 
Trp Val 


GCG 
Ala 


CTC 
Leu 


ACC 
Thr 
75 


CCC 
Pro 


ACG 
Thr 


CTC 
Leu 


GCA 
Ala 


GCT 
Ala 
80 


240 


AGG 
Arg 


AAC 
Asn 


GCC 
Ala 


AGC 
Ser 


GTC 
Val 
85 


CCC 
Pro 


ACC 
Thr 


ACG 
Thr 


ACA 
Thr 


ATA 
He 
90 


CGA CGC CAC 
Arg Arg His 


GTC GAT 
Val Asp 
95 


TTG 
Leu 


288 


Leu 

P 


GTT 
Val 


GGG 
Gly 


GCG 
Ala 
100 


GCT 
Ala 


GCT 
Ala 


CTC TGT 
Leu Cys 


TCC 
Ser 
105 


GCT 
Ala 


ATG TAC GTG 
Met Tyr Val 


GGG GAT 
Gly Asp 
110 


CTC 
Leu 


336 


^4^pC 

H 


GGA 
Gly 


TCT 
Ser 
115 


GTC 
Val 


TTC 
Phe 


CTC 
Leu 


GTC 
Val 


TCC 
Ser 
120 


CAG 
Gin 


CTG 
Leu 


TTC 
Phe 


ACC 
Thr 


ATC 
He 
125 


TCG 
Ser 


CCT CGC 
Pro Arg 


384 


Li' 

m 


PAT 

His 
130 


GAG 
Glu 


ACG 
Thr 


GTG 
Val 


CAG 
Gin 


GAC TGC 
Asp Cys 
135 


AAT 
Asn 


TGC 
Cys 


TCA 
Ser 


ATC 
He 
140 


TAT 
Tyr 


CCC 
Pro 


GGC 
Gly 


CAC 
His 


432 


ATA 
fee 


ACA 
Thr 


GGT 
Gly 


CAC 
His 


CGT 
Arg 


ATG 
Met 
150 


GCT 
Ala 


TGG 
Trp 


GAT 
Asp 


ATG 
Met 


ATG 
Met 
155 


ATG 
Met 


AAC 
Asn 


TGG 
Trp 


TCG 
Ser 


CCT 
Pro 
160 


480 


- Hi 

ACA ACG 

'jcfir Thr 


GCC 
Ala 


CTG 
Leu 


GTG 
Val 
165 


GTA 
Val 


TCG 
Ser 


CAG 
Gin 


CTG 
Leu 


CTC 
Leu 
170 


CGG 
Arg 


ATC 
He 


CCA 
Pro 


CAA 
Gin 


GCT 
Ala 
175 


GTC 
Val 


528 


GTG 
Val 


GAC 
Asp 


ATG 
Met 


GTG 
Val 
180 


GCG 
Ala 


GGG 
Gly 


GCC 
Ala 


CAT 
His 


TGG GGA 
Trp Gly 
185 


GTC 
Val 


CTG 
Leu 


GCG 
Ala 


GGC CTC 
Gly Leu 
190 


GCC 
Ala 


576 


TAC 
Tyr 


TAT 
Tyr 


TCC 
Ser 
195 


ATG 
Met 


GTG 
Val 


GGG 
Gly 


AAC TGG 
Asn Trp 
200 


GCT 
Ala 


AAG 
Lys 


GTT 
Val 


TTG 
Leu 


ATT 
He 
205 


GTG 
Val 


ATG 
Met 


CTA 
Leu 


624 


CTC 
Leu 


TTT 
Phe 
210 


GCT 
Ala 


CTC 
Leu 


TAATAG 






















642 


(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 


4: 



















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Leu Ser Cys 
15 10 15 
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Leu 



Tyr 



Ala 



Asn 
65 

Arg 



Leu 

□ 



s=sr 

M 



w 



Thr He Pro Ala Ser Ala 
20 

His Val Thr Asn Asp Cys 
35 

Asp Met He Met His Thr 
50 55 

Asn Ser Ser Arg Cys Trp 
70 

Asn Ala Ser Val Pro Thr 
85 

Val Gly Ala Ala Ala Leu 
100 

Gly Ser Val Phe Leu Val 
115 

His Glu Thr Val Gin Asp 
130 135 

Thr Gly His Arg Met Ala 
150 

Thr Ala Leu Val Val Ser 
165 

Asp Met Val Ala Gly Ala 
180 

Tyr Ser Met Val Gly Asn 
195 



Tyr Glu Val Arg Asn Val Ser Gly Met 
25 30 

Ser Asn Ser Ser He Val Tyr Glu Ala 
40 45 

Pro Gly Cys Val Pro Cys Val Arg Glu 
60 

Val Ala Leu Thr Pro Thr Leu Ala Ala 

75 80 

Thr Thr He Arg Arg His Val Asp Leu 
90 95 

Cys Ser Ala Met Tyr Val Gly Asp Leu 
105 110 

Ser Gin Leu Phe Thr He Ser Pro Arg 
120 125 

Cys Asn Cys Ser He Tyr Pro Gly His 
140 

Trp Asp Met Met Met Asn Trp Ser Pro 

155 160 

Gin Leu Leu Arg He Pro Gin Ala Val 
170 175 

His Trp Gly Val Leu Ala Gly Leu Ala 
185 190 

Trp Ala Lys Val Leu He Val Met Leu 
200 205 



Leu 



Phe Ala Leu 
210 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: l.,792 

(ix) FEATURE: 
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. (A) NAME/KEY: mat_peptide 
(B) LOCATION: 1..789 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATG TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 
Met Leu Gly Lys Val lie Asp Thr Leu Thr Cys Giy Phe Ala Asp Leu 
15 10 15 

GTG GGG TAG ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 
Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 
Ala Leu Ala His Gly Val Arg Val Leu Giu Asp Gly Val Asn Tyr Ala 
35 40 45 

i^A GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 
"^r Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 
u Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
^5 70 75 80 

ro 

jBc GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
i= 85 90 95 

C=5I 

c - 

St gag GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 

@r Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 

d 100 105 110 

o 

(PT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC 
Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

GTC GAT TTG CTC GTT GGG GCG GCT GCT TTC TGT TCC GCT ATG TAC GTG 
Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 150 155 160 

GGG GAC CTC TGC GGA TCT GTC TTC CTC GTC TCC CAG CTG TTC ACC ATC 
Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
165 170 175 

TCG CCT CGC CGG CAT GAG ACG GTG CAG GAC TGC AAT TGC TCA ATC TAT 
Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
180 185 190 

CCC GGC CAC ATA ACG GGT CAC CGT ATG GCT TGG GAT ATG ATG ATG AAC 
Pro Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
195 200 205 

TGG TCG CCT ACA ACG GCC CTG GTG GTA TCG CAG CTG CTC CGG ATC CCA 
Trp Ser Pro Thr Thr Ala Leu Val Val Ser Gin Leu Leu Arg He Pro 
210 215 220 




CAA GCT GTC GTG GAC ATG GTG GCG GGG GCC CAT TGG GGA GTC CTG GCG 720 

Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val Leu Ala 
225 230 235 240 

GGT GTC GCC TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG GTT TTG ATT 768 

Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu He 
245 250 255 

GTG ATG CTA CTC TTT GCT CCC TAATAG 795 

Val Met Leu Leu Phe Ala Pro 
260 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
p (A) LENGTH: 263 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

•W 

B (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

ry 

Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
fi 5 10 15 

V& Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
. 20 25 30 

Pl^ Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
f^t 35 40 45 

'Mr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

Ser Giv Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

Val AsD Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 " 150 155 160 

Glv Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
165 170 175 

Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
180 185 190 
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Pro Gly His lie Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
195 200 205 

Trp Ser Pro Thr Thr Ala Leu Val Val Ser Gin Leu Leu Arg He Pro 
210 215 220 

Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val Leu Ala 
225 230 235 240 

Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu He 
245 250 255 



Val Met Leu Leu Phe Ala Pro 
260 



(2) INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACTERISTICS: 
^"^^ (A) LENGTH: 633 base pairs 

(B) TYPE: nucleic acid 
H (C) STRANDEDNESS : single 

ly (D) TOPOLOGY: linear 



□ 



(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



H (iii) ANTI-SENSE: NO 

rSSff 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
□ (B) LOCATION: 1..630 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1..627 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ATG TTG GGT AAG GTC ATC GAT ACC CTT ACG TGC GGC TTC GCC GAC CTC 48 
Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

ATG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGT GCT GCC AGA 96 
Met Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GGG CAT GGC GTC CGG GTT CTG GAA GAC GGC GTG AAC TAT GCA 14 4 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

ACA GGG AAT TTG OCT GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTA 192 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

CTG TCC TGT CTG ACC ATT CCA GCT TCC GCT TAT GAG GTG CGC AAC GTG 240 
Leu Ser Cys Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 
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TCC GGG ATG TAG CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 288 
Ser Giy Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val 
85 90 95 

TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 336 
Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 384 
Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

CTC GCA GCT AGG AAC GCC AGC GTC CCC ACT ACG ACA ATA CGA CGC CAC 4 32 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

GTC GAT TTG CTC GTT GGG GCG GCT GCT TTC TGT TCC GCT ATG TAC GTG 480 
^1 Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
^5 150 155 160 

^GG GAT CTC TGC GGA TCT GTC TTC CTC GTC TCC CAG CTG TTC ACC ATC 528 
=Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
^ 165 170 175 

iCG CCT CGC CGG CAT GAG ACG GTG CAG GAC TGC AAT TGC TCA ATC TAT 57 6 

ISer Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
^ 180 185 190 

ice GGC CAC ATA ACA GGT CAC CGT ATG GCT TGG GAT ATG ATG ATG AAC 624 

'1^0 Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
H^- 195 200 205 

TAATAG ^33 

210 

(2) INFOEIMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 209 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

Met Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 
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Leu Ser Cys Leu Thr lie Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 150 155 160 

(©y Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
^5 165 170 175 

Sea: Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
't ifln 185 190 



165 170 
Arg His Glu Thr Val Gin 

ffo Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
FM 195 200 205 

yi 

Isrp 



^ INFORMATION FOR SEQ ID NO: 9: 



P (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 483' base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
. (iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..480 

(ix) FEATURE: 

^ (A) NAME/KEY: mat_peptide 
(B) LOCATION: 1..477 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

ATG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCC CTG CTG TCC TGT 48 
Met Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Leu Ser Cys 
1 5 10 15 

CTG ACC ATA CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG TCC GGG GTG 96 
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Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val Ser Gly Val 

20 25 30 

TAG CAT GTC ACG AAC GAG TGG TCC AAC TCA AGC ATA GTG TAT GAG GCA 144 

Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr Glu Ala 

35 40 45 

GCG GAG ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC GTT CGG GAG 192 

Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 

50 55 60 

GGC AAC TCC TCC CGT TGC TGG GTG GCG CTC ACT CCC ACG CTC GCG GCC 240 

Gly Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala 

65 70 75 80 

AGG AAC GCC AGC GTC CCC ACA ACG ACA ATA CGA CGC CAC GTC GAT TTG 288 

Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His Val Asp Leu 

85 90 95 

C| 

iC?rC GTT GGG GCT GCT GCT TTC TGT TCC GCT ATG TAC GTG GGG GAT CTC 336 

Seu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val Gly Asp Leu 

^! 100 105 110 



i^GC GGA TCT GTT TTC CTT GTT TCC CAG CTG TTC ACC TTC TCA CCT CGC 384 

^s Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr Phe Ser Pro Arg 
ni 115 120 125 

CGG CAT CAA ACA GTA CAG GAC TGC AAC TGC TCA ATC TAT CCC GGC CAT 432 

Arg His Gin Thr Val Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His 
130 135 140 

M^A TCA GGT CAC CGC ATG GCT TGG GAT ATG ATG ATG AAC TGG TCC TAATAG 483 
^1 Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp Ser 

Hf'i45 150 155 160 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Pro Gly Cys Ser Phe Ser lie Phe Leu Leu Ala Leu Leu Ser Cys 
15 10 15 

Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val Ser Gly Val 
20 25 30 

Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr Glu Ala 
35 40 45 

Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 
50 55 60 

Glv Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala 
65 70 75 80 
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9 



Ara Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His Val Asp Leu 
^ 85 90 95 

Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val Gly Asp Leu 
100 105 110 

Cvs Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr Phe Ser Pro Arg 
^ 115 120 125 

Arg His Gin Thr Val Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His 
130 135 140 

Val Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp Ser 
145 150 155 



^) INFORMATION FOR SEQ ID NO: 11: 

if (i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 480 base pairs 

\| (B) TYPE: nucleic acid 

; I (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



h (iii) ANTI-SENSE: NO 



& (ix) FEATURE: 

P (A) NAME/KEY: CDS 

^ (B) LOCATION: 1..477 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1..474 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



20 

;tc ACG 
fal Thr 
35 



50 



65 



TCT 


TTC 


TCT 


ATC 


TTC 


CTC 


TTG 


GCC CTG CTG TCC TGT 


48 


Ser 


Phe 


Ser 


He 


Phe 


Leu 


Leu 


Ala Leu Leu Ser Cys 




5 










10 




15 




GCT 


TCC 


GCT 


TAT 


GAA 


GTG 


CGC 


AAC GTG TCC GGG GTG 


96 


Ala 


Ser 


Ala 


Tyr 


Glu 


Val 


Arg 


Asn Val Ser Gly Val 










25 






30 


144 


AAC 


GAC 


TGC 


TCC 


AAC 


TCA 


AGC 


ATA GTG TAT GAG GCA 


Asn Asp 


Cys 


Ser 


Asn 


Ser 


Ser 


He Val Tyr Glu Ala 








40 








45 




ATG 


CAC 


ACC 


ccc 


GGG 


TGC 


GTG 


CCC TGC GTT CGG GAG 


192 


Met 


His 


Thr 


Pro 


Gly 


Cys 


Val 


Pro Cys Val Arg Glu 








55 








60 




CGT 


TGC 


TGG 


GTG 


GCG 


CTC 


ACT 


CCC ACG CTC GCG GCC 


240 


Arg Cys 


Trp 


Val 


Ala 


Leu 


Thr 


Pro Thr Leu Ala Ala 






70 








75 


80 
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S 
ill 



AGG AAC GCC AGC GTC CCC ACA ACG ACA ATA CGA CGC CAC GTC GAT TTG 288 
Arg Asn Ala Ser Vai Pro Thr Thr Thr lie Arg Arg His Vai Asp Leu 
85 90 95 

CTC GTT GGG GCT GCT GCT TTC TGT TCC GCT ATG TAG GTG GGG GAT GTC 336 
Leu Vai Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Vai Gly Asp Leu 
100 105 110 

TGC GGA TCT GTT TTC CTT GTT TCC CAG CTG TTC ACC TTC TCA CCT CGC 384 
Cys Gly Ser Vai Phe Leu Vai Ser Gin Leu Phe Thr Phe Ser Pro Arg 
115 120 125 

CGG CAT CAA ACA GTA CAG GAC TGC AAC TGC TCA ATC TAT CCC GGC CAT 432 
Arg His Gin Thr Vai Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His 
130 135 140 

GTA TCA GGT CAC CGC ATG GCT TGG GAT ATG ATG ATG AAC TGG TAATAG 480 
©1 Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 
t?5 150 155 

H) INFORMATION FOR SEQ ID NO: 12: 

yj 

_ (i) SEQUENCE CHARACTERISTICS: 

fU (A) LENGTH: 158 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

U (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

mt Ser Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Leu Ser Cys 
r^l 5 10 15 

1=55 

Leu Thr He Pro Ala Ser Ala Tyr Glu Vai Arg Asn Vai Ser Gly Vai 
20 25 30 

Tyr His Vai Thr Asn Asp Cys Ser Asn Ser Ser He Vai Tyr Glu Ala 
35 40 45 

Ala Asp Met He Met His Thr Pro Gly Cys Vai Pro Cys Vai Arg Glu 

50 55 60 

Gly Asn Ser Ser Arg Cys Trp Vai Ala Leu Thr Pro Thr Leu Ala Ala 

65 70 75 80 

Arg Asn Ala Ser Vai Pro Thr Thr Thr He Arg Arg His Vai Asp Leu 
85 90 95 

Leu Vai Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Vai Gly Asp Leu 
100 105 110 

Cvs Gly Ser Vai Phe Leu Vai Ser Gin Leu Phe Thr Phe Ser Pro Arg 
115 120 125 

Arg His Gin Thr Vai Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His 
130 135 140 

Vai Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 
145 150 155 
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(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 636 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..633 



J; (ix) FEATURE: 

^ (A) NAME/KEY: mat_peptide 

Ul (B) LOCATION: 1..630 



few 

m 



Ijl (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

A^G CTG GGT AAG GCC ATC GAT ACC CTT ACG TGC GGC TTC GCC GAC CTC 48 

Mel: Leu Gly Lys Ala He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
■Ql 5 10 15 



! 



GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 96 
V^i Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg ■ 
^ 20 25 30 

i ■ 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAA GAC GGC GTG AAC TAT GCA 144 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

ACA GGG AAT TTG CCT GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTA 192 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser lie Phe Leu Leu Ala Leu 
50 55 60 

CTG TCC TGT CTA ACC ATT CCA GCT TCC GCT TAC GAG GTG CGC AAC GTG 240 
Leu Ser Cys Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

TCC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 288 
Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val 
85 90 95 

TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 336 
Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 384 
Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

CTC GCG GCT AGG AAC GCC AGC ATC CCC ACT ACA ACA ATA CGA CGC CAC 432 
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480 



Leu Ala Ala Arg Asn Ala Ser He Pro Thr Thr Thr lie Arg Arg His 
130 135 140 

GTC GAT TTG CTC GTT GGG GCG GCT GCT TTC TGT TCC GCT ATG TAG GTG 
Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val- 
145 150 155 160 

GGG GAT CTC TGC GGA TCT GTC TTC CTC GTC TCC CAG CTG TTC ACC ATC 528 
Glv AsD Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
^ ^ 165 170 175 

TCG CCT CGC CGG CAT GAG ACG GTG CAG GAC TGC AAT TGC TCA ATC TAT 576 
Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
180 185 190 

CCC GGC CAC ATA ACG GGT CAC CGT ATG GCT TGG GAT ATG ATG ATG AAC 624 
Pro Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
195 200 205 

<SST 

^GG TAC TAATAG 
#rp Tyr 

\l 210 

y 

nJt2) INFORMATION FOR SEQ ID NO: 14: 

U| 

g (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 amino acids 

(B) TYPE: amino acid 
& (D) TOPOLOGY: linear 

S (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Leu Gly Lys Ala He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

Leu Ser Cys Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser He Pro Thr Thr Thr He Arg Arg His 



HCVENV.-T 



9 ^ 



130 135 140 

Val Asp Leu Leu Vai Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 150 155 160 

Glv Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
165 170 175 

Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
180 185 190 

Pro Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
195 200 205 

Trp Tyr 
210 

^) INFORMATION FOR SEQ ID NO: 15: 

ifl (i) SEQUENCE CHARACTERISTICS: 
\\ (A) LENGTH: 26 base pairs 

1,5 (B) TYPE: nucleic acid 

S (C) STRANDEDNESS : single 

W (D) TOPOLOGY: linear 

Ul (ii) MOLECULE TYPE: cDNA 

e 

(iii) HYPOTHETICAL: NO 

y (iii) ANTI-SENSE: NO 

b 
□ 

p (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

ATGCCCGGTT GCTCTTTCTC TATCTT 26 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
ATGTTGGGTA AGGTCATCGA TACCCT 26 
(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CTATTAGGAC CAGTTCATCA TCATATCCCA 

(2) INFORMATION FOR SEQ ID NO: 18: 

□ 

'iQ (i) SEQUENCE CHARACTERISTICS: 
.j| (A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 
■! (C) STRANDEDNESS: single 

'yi (D) TOPOLOGY: linear 

□ 

Si (ii) MOLECULE TYPE: cDNA 

Ul 

g (iii) HYPOTHETICAL: NO 

b (iii) ANTI-SENSE: YES 

P 

Pa 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO 

Mattaccag TTCATCATCA TATCCCA 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 

ATACGACGCC ACGTCGATTC CCAGCTGTTC ACCATC 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 36 base pairs 




(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GATGGTGAAC AGCTGGGAAT CGACGTGGCG TCGTAT 36 
(2) INFORMATION FOR SEQ ID NO: 21: 

5 ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 723 base pairs 

(B) TYPE: nucleic acid 
%| (C) STRANDEDNESS: single 
I A (D) TOPOLOGY: linear 

^ (ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

M' (iii) ANTI-SENSE: NO 

E (ix) FEATURE: 
y (A) NAME/KEY: CDS 

(B) LOCATION: 1..720 



(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1..717 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

ATG TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 48 
Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 96 
Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 144 
Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 192 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG . 240 
Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 
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130 



CS45 



m 



ATG 


TAG 


CAT 


GTC 


ACG 


AAC 


GAC 


TGC 


TCC 


AAC 


TCA AGC 


ATT 


GTG 


288 


Met 


Tyr 


His 


Val 


Thr 


Asn Asp Cys 


Ser 


Asn 


Ser Ser 


He 


Val 






85 










90 








95 






GCA 


GCG 


GAC 


ATG 


ATC 


ATG 


CAC 


Arc 


CCC 


GGG 


TGC GTG 


CCC 


TGC 


336 


Ala 


Ala 


Asp 


Met 


He 


Met 


His 


Thr 


Pro 


Gly Cys Val 


Pro Cys 














105 








110 








GAG 


AAC 


AAC 


TCT 


TCC 


CGC 


TGC 


TGG 


GTA 


GCG 


CTC ACC 


CCC 


ACG 


384 


Glu 


Asn 


Asn 


Ser 


Ser 


Arg Cys 


lip 


Val 


Ala 


Leu Thr 


Pro 


Thr 




115 










120 










125 








GCT 


AGG 


AAC 


GCC 


AGC 


GTC 


CCC 


ACC 


ACG 


ACA 


ATA CGA 


CGC 


CAC 


432 


Ala 


Arg 


Asn 


Ala 


Ser 


Val 


Pro 


Thr 


Thr 


Thr 


He Arg Arg 


His 










135 










140 










TCC 


CAG- 


CTG 


TTC 


ACC 


ATC 


TCG 


CCT 


CGC 


CGG 


CAT GAG 


ACG 


GTG 


480 


Ser 


Gln 


Leu 


Phe 


Thr 


He 


Ser 


Pro 


Arg 


Arg 


His Glu 


Thr 


Val 










150 










ICC 

155 








160 




TGC 


AAT 


TGC 


TCA 


ATC 


TAT 


CCC 


GGC 


CAC 


ATA 


ACG GGT 


CAC 


CGT 


528 


Cys 


Asn 


Cys 


Ser 


He 


Tyr 


Pro Gly 


His 


He Thr Gly His Arg 






165 










170 








175 






TGG 


GAT 


ATG 


ATG 


ATG 


AAC 


TGG 


TCG 


CCT 


ACA 


ACG GCC 


CTG 


GTG 


576 


Trp 


Asp 


Met 


Met 


Met 


Asn 


Trp 


Ser 


Pro 


Thr 


Thr Ala 


Leu 


Val 




180 










185 








190 








CAG 


CTG 


CTC 


CGG 


ATC 


CCA 


CAA 


GCT 


GTC 


GTG 


GAC ATG 


GTG 


GCG 


624 


Gin 


Leu 


Leu 


Arg 


He 


Pro 


Gin 


Ala 


Val 


Val 


Asp Met 


Val 


Ala 




195 








200 










205 








CAT 


TGG 


GGA 


GTC 


CTG 


GCG 


GGT 


CTC 


GCC 


TAC 


TAT TCC 


ATG 


GTG 


672 


His 


Trp 


Gly 


Val 


Leu 


Ala 


Gly Leu 


Ala 


Tyr Tyr Ser 


Met 


Val 








215 










220 










TGG 


GCT 


AAG 


GTT 


TTG 


ATT 


GTG 


ATG 


CTA 


CTC 


TTT GCT 


CCC 


TAATAG 


723 


Trp 


Ala 


Lys 


Val 


Leu 


He 


Val 


Met 


Leu 


Leu 


Phe Ala 


Pro 


240 






230 










235 











pi 210 



225 

(2) INFOE^TION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 239 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 
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Phe Ser lie Phe Leu Leu Aia Leu 
60 

Ser Aia Tyr Giu Vai Arg Asn Vai 

75 80 

Asp Cys Ser Asn Ser Ser lie Vai 
90 95 

His Thr Pro Giy Cys Vai Pro Cys 
105 110 

Cys Trp Vai Ala Leu Thr Pro Thr 
125 

Pro Thr Thr Thr lie Arg Arg His 
140 

Ser Pro Arg Arg His Giu Thr Vai 
155 160 

Pro Giy His lie Thr Giy His Arg 
170 175 

Trp Ser Pro Thr Thr Aia Leu Vai 
185 190 

Gin Aia Vai Vai Asp Met Vai Ala 
205 

Giy Leu Aia Tyr Tyr Ser Met Val 
220 

Vai Met Leu Leu Phe Aia Pro 
235 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 561 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..558 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION; 1..555 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
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Thr Giy Asn Leu Pro Giy Cys Ser 

50 55 

Leu Ser Cys Leu Thr Vai Pro Aia 

65 70 

Ser Giy Met Tyr His Vai Thr Asn 
85 

Tyr Giu Ala Aia Asp Met lie Met 
100 

Vai Arg Giu Asn Asn Ser Ser Arg 

115 120 

Leu Ala Aia Arg Asn Aia Ser Val 
130 135 

Vai Asp Ser Gin Leu Phe Thr lie 
^5 150 

^n Asp Cys Asn Cys Ser lie Tyr 
M 165 

w 

M§t Ala Trp Asp Met Met Met Asn 
180 

\U 

Val Ser Gin Leu Leu Arg lie Pro 
3 195 200 

^y Aia His Trp Giy Vai Leu Aia 
210 215 

Qy Asn Trp Aia Lys Vai Leu lie 
^5 230 



(2) INFORMATION FOR SEQ ID NO: 2 



ATG TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 48 
Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 96 
Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 144 
Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 192 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

mG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 240 
^u Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 

70 75 80 

'^C GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 288 
iSfer Glv Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
^85 90 95 

Wit 

[t^T gag GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 336 
'+yr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
s 100 105 110 

OTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 384 
^1 Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
g 115 120 125 

©rC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC 432 
teeu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 



GTC GAT TCC CAG CTG TTC ACC ATC TCG CCT CGC CGG CAT GAG ACG GTG 
Val Asp Ser Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val 
145 150 155 160 



480 



CAG GAC TGC AAT TGC TCA ATC TAT CCC GGC CAC ATA ACG GGT CAC CGT 528 
Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg 
165 170 175 

ATG GCT TGG GAT ATG ATG ATG AAC TGG TAATAG 561 
Met Ala Trp Asp Met Met Met Asn Trp 
180 185 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 185 amino acids 

(B) TYPE: amino acid 
' (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
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1 5 10 15 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

T5ir Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
'% 100 105 110 

\3§1 Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
M 115 120 125 

w 

mn Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
J 130 135 140 

^1 Asp Ser Gin Leu Phe Thr lie Ser Pro Arg Arg His Giu Thr Vai 
|45 150 155 160 

Asp Cys Asn Cys Ser lie Tyr Pro Gly His lie Thr Gly His Arg 
165 170 175 

Mt Ala Trp Asp Met Met Met Asn Trp 
y 180 185 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 606 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..603 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1..600 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
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ATG 


TTG 


GGT 


AAG 


GTC 


ATC 


GAT 


ACC 


CTT 


ACA 


TGC 


GGC TTC GCC GAC 


CTC 


4 O 


Met 


Leu 


Gly 


Lys 


Val 


He 


Asp 


Thr 


Leu 


Thr Cys Gly Phe Ala Asp 


Leu 




1 




5 










10 




15 






GTG 


GGG 


TAG 


ATT 


CCG 


CTC 


GTC 


GGC 


GCC 


CCC 


CTA 


GGG GGC GCT GCC 


AGG 


y D 


Val Gly 


Tvr 


lie 


Pro 


Leu 


Val 


Gly Ala 


Pro Leu Gly Gly Ala Ala Arg 








20 










25 






30 






GCC 


CTG 


GCG 


CAT 


GGC 


GTC 


CGG 


GTT 


CTG 


GAG 


GAC 


GGC GTG AAC TAT 


GCA 


X 4 4 


Ala 


Leu 


Ala 


His 


Gly 


Val 


Arg 


Val 


Leu 


Glu Asp Gly Val Asn Tyr Ala 








35 








40 








45 






ACA 


GGG 


AAT 


TTG 


CCC 


GGT 


TGC 


TCT 


TTC 


TCT 


ATC 


TTC CTC TTG GCT 


TTG 


1 

xy^ 


Thr Gly 


Asn 


Leu 


Pro 


Gly 


Cys 


Ser 


Phe 


Ser 


He 


Phe Leu Leu Ala 


Leu 






50 










55 










60 






CTG 


TCC 


TGT 


CTG 


ACC 


GTT 


CCA 


GCT 


TCC 


GCT 


TAT 


GAA GTG CGC AAC 


GTG 


74 0 
^*%\J 




Ser 


Cys 


Leu 


Thr 


Val 


Pro 


Ala 


Ser 


Ala Tyr Glu Val Arg Asn 


Val 












70 










75 




80 




■^C GGG 


ATG 


TAC 


CAT 


GTC 


ACG 


AAC 


GAC 


TGC 


TCC 


AAC TCA AGC ATT 


GTG 


288 

^ o o 




Gly 


Met 


Tyr 


His 


Val 


Thr 


Asn 


Asp 


Cys 


Ser 


Asn Ser Ser He 


Val 




y 




85 










90 




95 








GAG 


GCA 


GCG 


GAC 


ATG 


ATC 


ATG 


CAC 


ACC 


CCC 


GGG TGC GTG CCC 


TGC 


336 

^ ■J \j 




Glu 


Ala 


Ala 


Asp 


Met 


He 


Met 


His 


Thr 


Pro Gly Cys Val Pro 


Cys 










100 








105 






110 






(If T CGG 


GAG 


AAC 


AAC 


TCT 


TCC 


CGC 


TGC 


TGG 


GTA 


GCG CTC ACC CCC 


ACG 






Arg 


Glu 


Asn 


Asn 


Ser 


Ser 


Arg Cys 


Trp 


Val 


Ala Leu Thr Pro 


Thr 




N 


115 










120 








125 








GCA 


GCT 


AGG 


AAC 


GCC 


AGC 


GTC 


CCC 


ACC 


ACG 


ACA ATA CGA CGC 


CAC 


432 




Ala 


Ala 


Arg 


Asn 


Ala 


Ser 


Val 


Pro 


Thr 


Thr 


Thr He Arg Arg 


His 






130 








135 










140 






GTC 


GAT 


TCC 


CAG 


CTG 


TTC 


ACC 


ATC 


TCG 


CCT 


CGC 


CGG CAT GAG ACG 


GTG 


480 


Val 


Asp 


Ser 


Gin 


Leu 


Phe 


Thr 


He 


Ser 


Pro Arg 


Arg His Glu Thr 


Val 




145 








150 










155 




160 


528 


CAG 


GAC 


TGC 


AAT 


TGC 


TCA 


ATC 


TAT 


CCC 


GGC 


CAC 


ATA ACG GGT CAC 


CGT 


Gin Asp 


Cys 


Asn 


Cys 


Ser 


He 


Tyr 


Pro 


Gly His 


He Thr Gly His Arg 










165 










170 




175 






ATG 


GCT 


TGG 


GAT 


ATG 


ATG 


ATG 


AAC 


TGG 


TCG 


CCT 


ACA ACG GCC CTG 


GTG 


576 


Met 


Ala 


Trp 


Asp 


Met 


Met 


Met 


Asn 


Trp 


Ser 


Pro 


Thr Thr Ala Leu 


Val 








180 










185 






190 






GTA 


TCG 


CAG 


CTG 


CTC 


CGG 


ATC 


CTC 


TAATAG 








606 


Val 


Ser 


Gin 


Leu 


Leu 


Arg 


He 


Leu 















195 



200 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 200 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



HCVENV..T 



• 4» 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



Met Leu 
1 


Gly 


Lys 


Val 
5 


He Asp 


Thr 


Leu 


Thr Cys Gly Phe Ala Asp Leu 
10 15 


Val Gly 


Tyr 


He 
20 


Pro 


Leu 


Val 


Gly Ala 
25 


Pro Leu Gly Gly Ala Ala Arg 
30 


Ala Leu 


Ala 
35 


His 


Gly 


Val 


Arg 


Val 
40 


Leu 


Glu Asp Gly Val Asn Tyr Ala 
45 


Thr Gly 
50 


Asn 


Leu 


Pro 


Gly Cys 
55 


Ser 


Phe 


Ser He Phe Leu Leu 
60 


Ala Leu 


Leu Ser 
65 


Cys 


Leu 


Thr 


Val 
70 


Pro 


Ala 


Ser 


Ala Tyr Glu Val Arg 
75 


Asn Val 
80 


gpr Gly 


Met 


Tyr 


His 
85 


Val 


Thr 


Asn Asp 


Cys Ser Asn Ser Ser 
90 


He Val 
95 


yyz Giu 


Ala 


Ala 
100 


Asp 


Met 


He 


Met 


His 
105 


Thr Pro Gly Cys Val 
110 


Pro Cys 


W 

mi Arg 


Glu 
115 


Asn 


Asn 


Ser 


Ser 


Arg Cys 
120 


Trp Val Ala Leu Thr 
125 


Pro Thr 


'£lu Ala 
^ 130 

I ■ 

^1 Asp 


Ala 


Arg 


Asn 


Ala 


Ser 
135 


Val 


Pro 


Thr Thr Thr He Arg Arg His 
140 


Ser 


Gin 


Leu 


Phe 
150 


Thr 


He 


Ser 


Pro Arg Arg His Glu 
155 


Thr Val 
160 


gin ASP 


Cys 


Asn 


Cys 
165 


Ser 


He 


Tyr 


Pro 


Gly His He Thr Gly His Arg 
170 175 


Met Ala 


Trp 


Asp 
180 


Met 


Met 


Met 


Asn 


Trp 
185 


Ser Pro Thr Thr Ala 
190 


Leu Val 


Val Ser 


Gin 
195 


Leu 


Leu 


Arg 


He 


Leu 
200 









(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 636 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..633 
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(ix) FEATURE: 

(A) NAME/KEY: mat^peptide 

(B) LOCATION: 1..630 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

ATG TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 
Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
1 5 10 15 

GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 
Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 



GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 
Jda Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
Q 35 40 45 

^&A GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 
^hr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 

StG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 
^©u Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
lJl65 70 75 80 

^C GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 
ber Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
■P 85 90 95 

®M GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 
Wyr Glu Ala Ala Asp Met lie Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 
Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC 
Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
• 130 135 140 

GTC GAT TCC CAG CTG TTC ACC ATC TCG CCT CGC CGG CAT GAG ACG GTG 
Val Asp Ser Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val 
145 150 155 160 

CAG GAC TGC AAT TGC TCA ATC TAT CCC GGC CAC ATA ACG GGT CAC CGT 
Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg 
165 170 175 

ATG GCT TGG GAT ATG ATG ATG AAC TGG TCG CCT ACA ACG GCC CTG GTG 
Met Ala Trp Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val 
180 185 190 

GTA TCG CAG CTG CTC CGG ATC GTG ATC GAG GGC AGA CAC CAT CAC CAC 
Val Ser Gin Leu Leu Arg He Val He Glu Gly Arg His His His His 
195 200 205 



CAT CAC TAATAG 
His His 




210 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Met Leu Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

^(31 Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
5 20 25 30 

^a Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

w 

®r Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
nj 50 55 60 

m 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
r$5 70 75 80 

^r Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
Wr 85 90 95 

iSr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 

f 100 105 110 

pi 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

Val Asp Ser Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val 
145 150 155 160 

Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg 
165 170 175 

Met Ala Trp Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val 
180 185 190 

Val Ser Gin Leu Leu Arg He Val He Glu Gly Arg His His His His 
195 200 205 

His His 
210 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 630 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..627 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1..624 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



IMTG GGT AAG GTC ATC GAT ACC CTT ACG TGC GGA TTC GCC GAT CTC ATG 

Set Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu Met 

pj 15 10 15 

in 

IjGG TAC ATC CCG CTC GTC GGC GCT CCC GTA GGA GGC GTC GCA AGA GCC 

"Gly Tyr He Pro Leu Val Gly Ala Pro Val Gly Gly Val Ala Arg Ala 

h> 20 25 30 



GAC CTA TTA GTG GGC GCG GCC ACG ATG TGC TCT GCG CTC TAC GTG GGT 
Aso Leu Leu Val Gly Ala Ala Thr Met Cys Ser Ala Leu Tyr Val Gly 
145 150 155 160 



48 



96 



SSTT GCG CAT GGC GTG AGG GCC CTT GAA GAC GGG ATA AAT TTC GCA ACA 14 4 

fSeu Ala His Gly Val Arg Ala Leu Glu Asp Gly lie Asn Phe Ala Thr 
g 35 40 . 45 

^GG AAT TTG CCC GGT TGC TCC TTT TCT ATT TTC CTT CTC GCT CTG TTC 192 
Gly Asn Leu Pro Gly Cys Ser Phe Ser lie Phe Leu Leu Ala Leu Phe 
50 55 60 

TCT TGC TTA ATT CAT CCA GCA GCT AGT CTA GAG TGG CGG AAT ACG TCT 240 
Ser Cys Leu He His Pro Ala Ala Ser Leu Glu Trp Arg Asn Thr Ser 
65 70 75 80 

GGC CTC TAT GTC CTT ACC AAC GAC TGT TCC AAT AGC AGT ATT GTG TAC 288 
Gly Leu Tyr Val Leu Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr 
85 90 95 

GAG GCC GAT GAC GTT ATT CTG CAC ACA CCC GGC TGC ATA CCT TGT GTC 336 
Glu Ala Asp Asp Val lie Leu His Thr Pro Gly Cys He Pro Cys Val 
100 105 110 

CAG GAC GGC AAT ACA TCC ACG TGC TGG ACC CCA GTG ACA CCT ACA GTG 384 
Gin Asp Gly Asn Thr Ser Thr Cys Trp Thr Pro Val Thr Pro Thr Val 
115 120 125 

GCA GTC AAG TAC GTC GGA GCA ACC ACC GCT TCG ATA CGC AGT CAT GTG 432 
Ala Val Lys Tyr Val Gly Ala Thr Thr Ala Ser He Arg Ser His Val 
130 135 140 



480 



GAC ATG TGT GGG GCT GTC TTC CTC GTG GGA CAA GCC TTC ACG TTC AGA 
Asp Met Cys Gly Ala Val Phe Leu Val Gly Gin Ala Phe Thr Phe Arg 
165 170 175 

CCT CGT CGC CAT CAA ACG GTC CAG ACC TGT AAC TGC TCG CTG TAG CCA 
Pro Arg Arg His Gin Thr Val Gin Thr Cys Asn Cys Ser Leu Tyr Pro 
180 185 190 

GGC CAT CTT TCA GGA CAT CGA ATG GCT TGG GAT ATG ATG ATG AAC TGG 

Gly His Leu Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 
195 200 205 

TAATAG 



^2) INFORMATION FOR SEQ ID NO: 30: 

'fi (i) SEQUENCE CHARACTERISTICS: 
%l (A) LENGTH: 208 amino acids 

yj (B) TYPE: amino acid 

p (D) TOPOLOGY: linear 

|S (ii) MOLECULE TYPE: protein 

' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

SJet Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu Met 
1 5 10 15 

^ly Tyr He Pro Leu Val Gly Ala Pro Val Gly Gly Val Ala Arg Ala 
M 20 25 30 

Leu Ala His Gly Val Arg Ala Leu Glu Asp Gly He Asn Phe Ala Thr 
35 40 45 

Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Phe 
50 55 60 

Ser Cys Leu He His Pro Ala Ala Ser Leu Glu Trp Arg Asn Thr Ser 
65 70 75 80 

Gly Leu Tyr Val Leu Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr 
85 90 95 

Glu Ala Asp Asp Val He Leu His Thr Pro Gly Cys He Pro Cys Val 
100 105 110 

Gin Asp Gly Asn Thr Ser Thr Cys Trp Thr Pro Val Thr J»ro Thr Val 
115 120 125 

Ala Val Lys Tyr Val Gly Ala Thr Thr Ala Ser He Arg Ser His Val 
130 135 140 

Asp Leu Leu Val Gly Ala Ala Thr Met Cys Ser Ala Leu Tyr Val Gly 
145 150 155 160 

Asp Met Cys Gly Ala Val Phe Leu Val Gly Gin Ala Phe Thr Phe Arg 
165 170 175 



>ro Arg Arg His Gin Thr Val Gin Thr Cys Asn Cys Set Leu Tyr Pro 
130 185 190 



Gly His Leu Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 630 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

r=i 

t|| (ii) MOLECULE TYPE: cDNA 
^1 (iii) HYPOTHETICAL: NO 

W (iii) ANTI-SENSE: NO 

0 

yi (ix) FEATURE: 

g (A) NAME/KEY: CDS 

(B) LOCATION: 1..627 

9 (ix) FEATURE: 

(A) NAME/KEY: mat_peptide 
□ (B) LOCATION: 1..624 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



ATG GGT AAG GTC ATC GAT ACC CTA ACG TGC GGA TTC GCC GAT CTC ATG 
Met Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu Met 
15 10 15 

GGG TAT ATC CCG CTC GTA GGC GGC CCC ATT GGG GGC GTC GCA AGG GCT 
Gly Tyr He Pro Leu Val Gly Gly Pro He Gly Gly Val Ala Arg Ala 
20 25 30 

CTC GCA CAC GGT GTG AGG GTC CTT GAG GAC GGG GTA AAC TAT GCA ACA 
Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala Thr 
35 40 45 

GGG AAT TTA CCC GGT TGC TCT TTC TCT ATC TTT ATT CTT GCT CTT CTC 
Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe He Leu Ala Leu Leu 
50 55 60 

TCG TGT CTG ACC GTT CCG GCC TCT GCA GTT CCC TAC CGA AAT GCC TCT 
Ser Cys Leu Thr Val Pro Ala Ser Ala Val Pro Tyr Arg Asn Ala Ser 
65 70 75 80 

GGG ATT TAT CAT GTT ACC AAT GAT TGC CCA AAC TCT TCC ATA GTC TAT 
ny He Tyr His Val Thr Asn Asp Cys Pro Asn Ser Ser He Val Tyr 
85 90 95 



GAG GCA GAT AAC CTG ATC CTA CAC GCA CCT GGT TGC GTG CCT TGT GTC 




Glu 


Ala 


Asp 


Asn 
100 


Leu 


He 


Leu 


His 


Ala Pro 
105 


Gly Cys Val Pro Cys Val 
110 




ATG 
Met 


ACA 
Thr 


GGT 
Gly 
115 


AAT 
Asn 


GTG 
Val 


AGT 
Ser 


AGA 
Arg 


TGC 
Cys 
120 


TGG GTC 
Trp Val 


CAA ATT ACC CCT ACA CTG 
Gin He Thr Pro Thr Leu 
125 


384 


TCA 
Ser 


GCC 
Ala 
130 


CCG 
Pro 


AGC 
Ser 


CTC 
Leu 


GGA 
Gly 


GCA 
Ala 
135 


GTC 
Val 


ACG GCT 
Thr Ala 


CCT CTT CGG AGA GCC GTT 
Pro Leu Arg Arg Ala Val 
140 


432 


GAC 
Asp 
145 


TAG 
Tyr 


CTA 
Leu 


GCG 
Ala 


GGA 
Gly 


GGG 
Gly 
150 


GCT 
Ala 


GCC 
Ala 


CTC TGC 
Leu Cys 


TCC GCG TTA TAC GTA GGA 
Ser Ala Leu Tyr Val Gly 
155 160 


480 


GAC 

0 

w 


GCG 
Ala 


TGT 
Cys 


GGG 
Gly 


GCA 
Ala 
165 


CTA 
Leu 


TTC 
Phe 


TTG GTA GGC 
Leu Val Gly 
170 


CAA ATG TTC ACC TAT AGG 
Gin Met Phe Thr Tyr Arg 
175 


528 


CGC 
Arg 


CAG 
Gin 


CAC 
His 
180 


GCT 
Ala 


ACG 
Thr 


GTG 
Val 


CAG AAC TGC 
Gin Asn Cys 
185 


AAC TGT TCC ATT TAC AGT 
Asn Cys Ser He Tyr Ser 
190 


576 


^c 


CAT 
His 


GTT 
Val 
195 


ACC 
Thr 


GGC 
Gly 


CAC 
His 


CGG 
Arg 


ATG 
Met 
200 


GCA TGG 
Ala Trp 


GAT ATG ATG ATG AAC TGG 
Asp Met Met Met Asn Trp 
205 


624 


^^TAG 


















630 


y 

S k' 






















hi 


INFORMATION 


FOR 


SEQ 


ID NO: ; 


32: 










(i) SEQUENCE 


CHARACTERISTICS: 







(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Met Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu Met 
1 5 10 15 

Glv Tyr He Pro Leu Val Gly Gly Pro He Gly Gly Val Ala Arg Ala 
20 25 30 

Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala Thr 
35 40 45 

Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe He Leu Ala Leu Leu 
50 55 60 

Ser Cys Leu Thr Val Pro Ala Ser Ala Val Pro Tyr Arg Asn Ala Ser 
65 70 75 80 

Glv He Tyr His Val Thr Asn Asp Cys Pro Asn Ser Ser He Val Tyr 
85 90 95 
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Glu Ala Asp Asn Leu lie Leu His Ala Pro Gly Cys Val Pro Cys Val 
100 105 110 

Met Thr Gly Asn Val Ser Arg Cys Trp Val Gin He Thr Pro Thr Leu 
115 120 125 

Ser Ala Pro Ser Leu Gly Ala Val Thr Ala Pro Leu Arg Arg Ala Val 
130 135 140 

Asp Tyr Leu Ala Gly Gly Ala Ala Leu Cys Ser Ala Leu Tyr Val Gly 
145 150 155 160 

Asp Ala Cys Gly Ala Leu Phe Leu Val Gly Gin Met Phe Thr Tyr Arg 
165 170 175 

Pro Arg Gin His Ala Thr Val Gin Asn Cys Asn Cys Ser He Tyr Ser 
180 185 190 

s 

Sly His Val Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 
J 195 200 205 

H 



n 



M) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid . 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



* (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

TGGGATATGA TGATGAACTG GTC 23 

(2) INFORMATION FOR SEQ ID NO: 34: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
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CTATTATGGT GGTAAGCCAC AGAGCAGGAG 30 
(2) INFOEIMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1476 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

5 (ix) FEATURE: 

(A) NAME/KEY: CDS 



(B) LOCATION: 1..1473 



_ (ix) FEATURE: 
0 (A) NAME/KEY: mat_peptide 

fy (B) LOCATION: 1..1470 

m 

f. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35; 



Sg 


GAT ATG 


ATG 


ATG 


AAC 


TGG 


TCG 


CCT 


ACA 


ACG 


GCC 


CTG GTG GTA 


TCG 


48 


irrp Asp Met 


Met 


Met 


Asn 


Trp 


Ser 


Pro 


Thr 


Thr 


Ala 


Leu Val Val 


Ser 




□ l 






5 










10 






15 






□ 


CTG CTC 


CGG 


ATC 


CCA 


CAA 


GCT 


GTC 


GTG 


GAC 


ATG 


GTG GCG GGG 


GCC 


' 96 


Gin 


Leu Leu 


Arg 


He 


Pro 


Gin 


Ala 


Val 


Val 


Asp 


Met 


Val Ala Gly 


Ala 






20 










25 








30 






CAT 


TGG GGA 


GTC 


CTG 


GCG 


GGC 


CTC 


GCC 


TAC 


TAT 


TCC 


ATG GTG GGG 


AAC 


144 


His 


Trp Gly 


Val 


Leu 


Ala 


Gly 


Leu 


Ala 


Tyr 


Tyr 


Ser 


Met Val Gly 


Asn 






35 










40 










45 




192 


TGG 


GCT AAG 


GTT 


TTG 


GTT 


GTG 


ATG 


CTA 


CTC 


TTT 


GCC 


GGC GTC GAC 


GGG 


Trp 


Ala Lys 


Val 


Leu 


Val 


Val 


Met 


Leu 


Leu 


Phe 


Ala 


Gly Val Asp 


Gly 




50 








55 










60 








CAT 


ACC CGC 


GTG 


TCA 


GGA 


GGG 


GCA 


GCA 


GCC 


TCC 


GAT 


ACC AGG GGC 


CTT 


240 


His 


Thr Arg 


Val 


Ser 


Gly 


Gly 


Ala 


Ala 


Ala 


Ser Asp 


Thr Arg Gly 


Leu 




65 






70 










75 






80 




GTG 


TCC CTC 


TTT 


AGO 


CCC 


GGG 


TCG 


GCT 


CAG 


AAA 


ATC 


CAG CTC GTA 


AAC 


288 


Val 


Ser Leu 


Phe 


Ser 


Pro 


Gly 


Ser 


Ala 


Gin 


Lys 


He 


Gin Leu Val 


Asn 








85 








90 






95 






ACC 


AAC GGC 


ACT 


TGG 


CAC 


ATC 


AAC 


AGG 


ACT 


GCC 


CTG 


AAC TGC AAC 


GAC 


-336 


Thr Asn Gly 


Ser 


Trp 


His 


lie 


Asn 


Arg 


Thr 


Ala 


Leu 


Asn Cys Asn 


Asp 








100 








105 








110 






TCC 


CTC CAA 


ACA 


GGG 


TTC 


TTT 


GCC 


GCA 


CTA 


TTC 


TAC 


AAA CAC AAA 


TTC 


384 


Ser 


Leu Gin 


Thr 


Gly 


Phe 


Phe 


Ala 


Ala 


Leu 


Phe 


Tyr 


Lys His Lys 


Phe 




115 








120 










125 






AAC 


TCG TCT 


GGA 


TGC 


CCA 


GAG 


CGC 


TTG 


GCC 


AGC 


TGT 


CGC TCC ATC 


GAC 


432 
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) 

( 



145 



SRC 

y 
;s5 

P 



Met 



Phe 
305 



Ser 


Ser 


Gly 


Cys 


Pro 


Glu Arg 


Leu Ala Ser 


Cys 


Arg 


Ser 


He Asp 




130 










135 






140 












TTC 


GCT 


CAG 


GGG 


TGG 


GGT 


CCC 


CTC ACT TAC 








AAC 


AGC 


*s O w 


Phe 


Ala 


Gin 


Gly 


Trp 


Gly 


Pro 


Leu Thr Tyr 


Thr 


Glu 


Pro 


Asn 


Ser 










150 






155 










160 




GAC 


CAG 


AGG 


CCC 


TAC 


TGC 


TGG 


CAC TAC GCG 




rem 




TGT 


GGT 


529 


Asp 


Gin 


Arg 


Pro 


Tyr 


Cys Trp His Tyr Ala 


Pro 


Arg 


Pro 


Cys 


Gly 






165 








170 








175 






GTA 


ccc 


GCG 


TCT 


(JAG 


GTG 


TGC 


GGT CCA GTG 


TflT 




TTP 


ACC 


CCG 


576 


Val 


Pro 


Ala 


Ser 


Gin 


Val 


Cys 


Gly Pro Val 


Tyr 


Cys 


Phe 


Thr 


Pro 








180 










185 






190 








CCT 


GTT 


GTG 


GTG 


GGG 


ACG 


ACC 


GAT CGG TTT 






rcc 


ACG 


TAT 


624 


Pro 


Val 


Val 


Val 


Gly 


Thr 


Thr Asp Arg Phe 


Gly 


Val 


Pro 


Thr Tyr 






195 








200 






205 










TGG 


GGG 


GCG 


AAC 


GAC 


TCG 


GAT 


GTG CTG ATT 








ACG 


CGG 


o / C 


Trp 


Gly 


Ala 


Asn 


Asp 


Ser Asp 


Val Leu He 


Leu 


Asn 


Asn 


Thr Arg 




210 








215 






220 












CCG 


CGA 


GGC 


AAC 


TGG 


TTC 


GGC 


TGT ACA TGG 


ATG 


AAi 


GGL 


ACT 


GGG 


7 90 

1 £.\J 


Pro 


Arg 


Gly 


Asn 


Trp 


Phe 


Gly 


Cys Thr Trp 


Met 


Asn 


Gly 


Thr Gly 








230 






235 










240 




ACC 


AAG 


ACG 


TGT 


GGG 


GGC 


CCC 


CCG TGC AAC 


ATC 


GGG 


GGG 


GCC 


GGC 


768 


Thr 


Lys 


Thr 


Cys 


Gly 


Gly 


Pro 


Pro Cys Asn 


He 


Gly 


Gly 


Ala Gly 








245 








250 








255 






AAC 


ACC 


TTG 


ACC 


TGC 


CCC 


ACT 


GAC TGT TTT 


CGG 


AAG 


CAC 


CCC 


GAG 


816 


Asn 


Thr 


Leu 


Thr 


Cys 


Pro 


Thr Asp Cys Phe 


Arg 


Lys 


His 


Pro 


Glu 








260 








265 






270 






864 


ACC 


TAC 


GCC 


AGA 


TGC 


GGT 


TCT 


GGG CCC TGG 






X 


AGG 


TGT 


Thr 


Tyr 


Ala 


Arg 


Cys 


Gly 


Ser 


Gly Pro Trp 


Leu 


Thr 


Pro 


Arg Cys 






275 










280 






285 










GTT 


CAT 


TAC 


CCA 


TAT 


AGG 


CTC 


TGG CAC TAC 


rcc 




APT 


GTC 


AAC 


912 


Val 


His 


Tyr 


Pro 


Tyr 


Arg 


Leu 


Trp His Tyr 


Pro 


Cys 


Thr 


Val 


Asn 




290 








295 






300 












ACC 


ATC 


TTC 


AAG 


GTT 


AGG 


ATG 


TAC GTG GGG 








CAC 


AGG 


960 


Thr 


He 


Phe 


Lys 


Val 


Arg 


Met 


Tyr Val Gly 


Gly 


Val 


Glu 


His 


Arg 










310 




315 










320 




GAA 


GCC 


GCA 


TGC 


AAT 


TGG 


ACT 


CGA GGA GAG 




iu i 




TTG 


GAG 


1008 

X V u a 


Glu 


Ala 


Ala 


Cys 


Asn 


Trp Thr Arg Gly Glu 


Arg 


Cys 


Asp 


Leu 


Glu 










325 








330 








335 






AGG 


GAT 


AGA 


TCA 


GAG 


CTT 


AGC 


CCG CTG CTG 








ACA 


GAG 


1056 


Arg 


Asp 


Arg 


Ser 


Glu 


Leu 


Ser 


Pro Leu Leu 


Leu 


Ser 


Thr 


Thr 


Glu 




340 










345 






350 








CAG 


ATA 


CTG 


CCC 


TGT 


TCC 


TTC 


ACC ACC CTG 


CCG 


GCC 


CTA 


TCC 


ACC 


1104 


Gin 


He 


Leu 


Pro 


Cys 


Ser 


Phe 


Thr Thr Leu 


Pro 


Ala 


Leu 


Ser 


Thr 






355 








360 






365 










CTG 


ATC 


CAC 


CTC 


CAT 


CAG 


AAC 


ATC GTG GAC 


GTG 


CAA 


TAC 


CTG 


TAC . 


1152 


Leu 


He 


His 


Leu 


His 


Gin 


Asn 


He Val Asp 


Val 


Gin 


Tyr 


Leu Tyr 




370 










375 






380 
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GGT GTA GGG TCG GCG GTT GTC TCC CTT GTC ATC AAA TGG GAG TAT GTC 1200 

Gly Val Gly Ser Ala Val Val Ser Leu Val lie Lys Trp Glu Tyr Val 

385 390 395 400 

CTG TTG CTC TTC CTT CTC CTG GCA GAC GCG CGC ATC TGC GCC TGC TTA 1248 

Leu Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg He Cys Ala Cys Leu 
405 410 415 

TGG ATG ATG CTG CTG ATA GCT CAA GCT GAG GCC GCC TTA GAG AAC CTG 1296 

Trp Met Met Leu Leu He Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu 
420 425 430 

GTG GTC CTC AAT GCG GCG GCC GTG GCC GGG GCG CAT GGC ACT CTT TCC 1344 

Val Val Leu Asn Ala Ala Ala Val Ala Gly Ala His Gly Thr Leu Ser 
435 440 445 

TTC CTT GTG TTC TTC TGT GCT GCC TGG TAC ATC AAG GGC AGG CTG GTC 1392 

ghe Leu Val Phe Phe Cys Ala Ala Trp Tyr He Lys Gly Arg Leu Val 

5 450 455 460 

^CT GGT GCG GCA TAC GCC TTC TAT GGC GTG TGG CCG CTG CTC CTG CTT 14 40 

'Ito Gly Ala Ala Tyr Ala Phe Tyr Gly Val Trp Pro Leu Leu Leu Leu 

470 475 480 

PSITG CTG GCC TTA CCA CCA CGA GCT TAT GCC TAGTAA 1476 

lipu Leu Ala Leu Pro Pro Arg Ala Tyr Ala 
485 490 



3 



?t^) INFORMATION FOR SEQ ID NO: 36: 

H- (i) SEQOENCE CHARACTERISTICS: 

p (A) LENGTH: 4 90 amino acids 

% (B) TYPE: amino acid 

H (D) TOPOLOGY: linear 

Hi 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Trp AsD Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val Val Ser 
.1*5 10 15 

Gin Leu Leu Arg He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala 
20 25 30 

His Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn 
35 40 45 

Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly 
50 55 60 

His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu 
65 70 75 80 

Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys He Gin Leu Val Asn 
85 90 95 

Thr Asn Gly Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp 
100 105 110 
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Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe 
115 120 125 

Asn Ser 'Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser He Asp 
130 135 140 

Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser 
145 150 155 160 

Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly 
165 170 175 

He Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro 
180 185 190 

Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr 
195 200 205 

^n Trp Gly Ala Asn Asp Ser Asp Val Leu He Leu Asn Asn Thr Arg 
m 210 215 220 

Ejfo Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly 
225 230 235 240 



Be Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn lie Gly Gly Ala Gly 
rU 245 250 255 

m 

iLsn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu 
260 265 270 

fla Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys 
275 280 285 



1^ 



t Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn 
290 295 300 



Phe Thr He Phe Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg 
305 310 315 320 

Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu 
325 330 335 

Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu 
340 345 350 

Trp Gin He Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr 
355 360 365 

Gly Leu He His Leu His Gin Asn He Val Asp Val Gin Tyr Leu Tyr 
370 375 380 

Gly Val Gly Ser Ala Val Val Ser Leu Val He Lys Trp Glu Tyr Val 
385 390 395 400 

Leu Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg He Cys Ala Cys Leu 
405 410 415 

Trp Met Met Leu Leu He Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu 
420 425 430 

Val Val Leu Asn Ala Ala Ala Val Ala Gly Ala His Gly Thr Leu Ser 
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435 440 445 

Phe Leu Val Phe Phe Cys Ala Ala Trp Tyr lie Lys Gly Arg Leu Val 
450 455 460 

Pro Gly Ala Ala Tyr Ala Phe Tyr Gly Val Trp Pro Leu Leu Leu Leu 
465 470 475 480 

Leu Leu Ala Leu Pro Pro Arg Ala Tyr Ala 
485 490 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1021 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

I 0 ' 

tsar 

(ii) MOLECULE TYPE: cDNA 
\i (iii) HYPOTHETICAL: NO 
p (iii) ANTI-SENSE: NO 

Si 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:. 2. .1018 



(ix) FEATURE: 
_ (A) NAME/KEY: mat_peptide 

M (B) LOCATION: 2.. 1015 



P 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 



G ATC CCA CAA GCT GTC GTG GAC ATG GTG GCG GGG GCC CAT TGG GGA 4 6 

He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly 
15 10 15 

GTC CTG GCG GGC CTC GCC TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG 94 
Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys 
20 25 30 

GTT TTG GTT GTG ATG CTA CTC TTT GCC GGC GTC GAC GGG CAT ACC CGC 142 
Val Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg 
35 40 45 

GTG TCA GGA GGG GCA GCA GCC TCC GAT ACC AGG GGC CTT GTG TCC CTC 190 
Val Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu 
50 55 60 

TTT AGC CCC GGG TCG GCT CAG AAA ATC CAG CTC GTA AAC ACC AAC GGC 238 
Phe Ser Pro Gly Ser Ala Gin Lys He Gin Leu Val Asn Thr Asn Gly 
65 70 75 

ACT TGG CAC ATC AAC AGG ACT GCC CTG AAC TGC AAC GAC TCC CTC CAA 286 
Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin. 
80 85 90 95 
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ACA GGG TTC TTT GCC GCA CTA TTC TAG AAA CAC AAA TTC AAC TCG TCT 
Thr Giy Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn Ser Ser 
100 105 110 

GGA TGC CCA GAG CGC TTG GCC AGC TGT CGC TCC ATC GAC AAG TTC GCT 
Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser He Asp Lys Phe Ala 
115 120 125 

CAG GGG TGG GGT CCC CTC ACT TAC ACT GAG CCT AAC AGC TCG GAC CAG 
Gin Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser Asp Gin 
130 135 140 

AGG CCC TAC TGC TGG CAC TAC GCG CCT CGA CCG TGT GGT ATT GTA CCC 
Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly lie Val Pro 
145 150 155 



334 



382 



430 



478 



GCG 

mo 
^^1 

W 
□ 

Sa 



TCT CAG GTG TGC GGT CCA GTG TAT TGC TTC ACC CCG AGC CCT GTT 
Ser Gin Val Cys Giy Pro Val Tyr Cys Phe Thr Pro Ser Pro Val 
165 170 175 

GTG GGG ACG ACC GAT CGG TTT GGT GTC CCC ACG TAT AAC TGG GGG 
Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn Trp Gly 
180 185 190 

AAC GAC TCG GAT GTG CTG ATT CTC AAC AAC ACG CGG CCG CCG CGA 
Asn AsD Ser Asp Val Leu He Leu Asn Asn Thr Arg Pro Pro Arg 
195 200 205 



526 



574 



622 



aSGC AAC TGG TTC GGC TGT ACA TGG ATG AAT GGC ACT GGG TTC ACC AAG 670 
^y Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe Thr Lys 
210 215 220 

Q:G TGT GGG GGC CCC CCG TGC AAC ATC GGG GGG GCC GGC AAC AAC ACC 718 
Sir Cys Gly Gly Pro Pro Cys Asn He Gly Gly Ala Gly Asn Asn Thr 
225 230 235 

TTG ACC TGC CCC ACT GAC TGT TTT CGG AAG CAC CCC GAG GCC ACC TAC 766 
Leu Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr Tyr 
240 245 250 255 

GCC AGA TGC GGT TCT GGG CCC TGG CTG ACA CCT AGG TGT ATG GTT CAT 814 
Ala Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met Val His 
260 265 270 

TAC CCA TAT AGG CTC TGG CAC TAC CCC TGC ACT GTC AAC TTC ACC ATC 862 
Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr He 
275 280 285 

TTC AAG GTT AGG ATG TAC GTG GGG GGC GTG GAG CAC AGG TTC GAA GCC 910 
Phe Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Phe Glu Ala 
290 295 300 

GCA TGC AAT TGG ACT CGA GGA GAG CGT TGT GAC TTG GAG GAC AGG GAT 958 
Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp 
305 310 315 

AGA TCA GAG CTT AGC CCG CTG CTG CTG TCT ACA ACA GAG TGG CAG AGT 1006 
Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gin Ser 
320 325 330 335. 

GGC AGA GCT TAATTA ^021 
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Gly Arg Ala 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQaENCE CHARACTERISTICS: 

(A) LENGTH: 338 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

lie Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val 
1 5 10 15 

£feu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val 

•^M 20 25 30 

4eu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg Val 
35 40 45 

g^r Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu Phe 
nj 50 55 60 

Ser Pro Gly Ser Ala Gin Lys He Gin Leu Val Asn Thr Asn Gly Ser 
1165 70 75 80 

Qrp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin Thr 
U 85 90 95 

Siy Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn Ser Ser Gly 
g 100 105 110 

Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser He Asp Lys Phe Ala Gin 
115 120 125 

Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser Asp Gin Arg 
130 135 140 

Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He Val Pro Ala 
145 150 155 160 

Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val 
165 170 175 

Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn Trp Gly Ala 
180 185 190 

Asn Asp Ser Asp Val Leu He Leu Asn Asn Thr Arg Pro Pro Arg Gly 
195 200 205 

Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe Thr Lys Thr 
210 215 220 

Cvs Glv Gly Pro Pro Cys Asn He Gly Gly Ala Gly Asn Asn Thr Leu 
225 230 235 240 

Thr Cvs Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr Tyr Ala 

245 250 255 
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Arq Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met Val His Tyr 
260 265 270 

Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr He Phe 
275 280 285 

Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Phe Glu Ala Ala 
290 295 300 

Cvs Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg 
305 310 315 320 

Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gin Ser Gly 
325 330 335 

Arg Ala 

□ 

INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1034 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
W (D) TOPOLOGY: linear 



M 



(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



U (ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 1032 

(ix) FEATURE: 

(A) NAME/KEY: mat^peptide 

(B) LOCATION: 2.. 1029 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

G ATC CCA CAA GCT GTC GTG GAC ATG GTG GCG GGG GCC CAT TGG GGA 4 6 

He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly 
15 10 15 

GTC CTG GCG GGC CTC GCC TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG 94 
Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys 

20 25 30 , 

GTT TTG GTT GTG ATG CTA CTC TTT GCC GGC GTC GAC GGG CAT ACC CGC 142 
Val Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg 
35 40 45 

GTG TCA GGA GGG GCA GCA GCC TCC GAT ACC AGG GGC CTT GTG TCC CTC 190 
Val Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu 
50 55 60 
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TTT AGC CCC GGG TCG GCT GAG AAA ATC GAG CTC GTA AAC ACC AAC GGC 
Phe Ser Pro Gly Ser Ala Gin Lys He Gin Leu Val Asn Thr Asn Gly 
65 70 75 

AGT TGG CAC ATC AAC AGG ACT GCC CTG AAC TGC AAC GAG TCC CTC CAA 
Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin 
80 85 90 95 

ACA GGG TTC TTT GCC GCA CTA TTC TAG AAA CAC AAA TTC AAC TCG TCT 
Thr Gly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn Ser Ser 
100 105 110 

GGA TGC CCA GAG CGC TTG GCC AGC TGT CGC TCC ATC GAG AAG TTC GCT 
Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser He Asp Lys Phe Ala 
115 120 125 

GAG GGG TGG GGT CCC CTC ACT TAG ACT GAG GCT AAC AGC TCG GAC GAG 
Gin Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser Asp Gin 
130 135 140 

5G CCC TAG TGC TGG CAC TAG GGG CCT GGA CCG TGT GGT ATT GTA CCC 
Ig Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He Val Pro 
>l 145 150 155 

Ul 

^G TCT GAG GTG TGC GGT CCA GTG TAT TGC TTC ACC CCG AGC CCT GTT 

Ma Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val 

So 165 170 175 



GTG GTG 
tfil Val 



MG AAC 
a Asn 

GGC AAC 
Gly Asn 



GGG AGG ACC GAT GGG TTT GGT GTC CCC ACG TAT AAC TGG GGG 
Gly Thr Thr Asd Arg Phe Gly Val Pro Thr Tyr Asn Trp Gly 
180 " 185 190 

GAC TCG GAT GTG CTG ATT CTC AAC AAC ACG CGG CCG CCG GGA 
Asp Ser Asp Val Leu He Leu Asn Asn Thr Arg Pro Pro Arg 
195 200 205 

TGG TTC GGC TGT ACA TGG ATG AAT GGC ACT GGG TTC ACC AAG 
Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe Thr Lys 
210 215 220 



238 



286 



334 



382 



430 



478 



526 



574 



622 



670 



ACG 


TGT 


GGG 


GGC 


CCC 


CCG TGC 


AAC 


ATC 


GGG 


GGG GCC GGC 


AAC 


AAC 


ACC 


718 


Thr 


Cys 


Gly 


Gly 


Pro 


Pro Cys 


Asn 


He 


Gly 


Gly Ala Gly Asn 


Asn 


Thr 






225 






230 








235 










TTG 


ACC 


TGC 


CCC 


ACT 


GAC TGT 


TTT 


CGG 


AAG 


CAC CCC GAG 


GCC 


ACC 


TAG 


766 


Leu 


Thr 


Cys 


Pro 


Thr 


Asp Cys 


Phe 


Arg 


Lys 


His Pro Glu 


Ala 


Thr 


Tyr 




240 








245 








250 






255 


814 


GCC 


AGA 


TGC 


GGT 


TCT 


GGG CCC 


TGG 


CTG 


ACA 


CCT AGG TGT 


ATG 


GTT 


CAT 


Ala Arg 


Cys 


Gly 


Ser 


Gly Pro 


Trp 


Leu 


Thr 


Pro Arg Cys 


Met 


Val 


His 










260 








265 






270 






TAG 


CCA 


TAT 


AGG 


CTC 


TGG CAC 


TAG 


CCC 


TGC 


ACT GTC AAC 


TTC 


ACC 


ATC 


862 


Tyr 


Pro 


Tyr 


Arg 


Leu 


Trp His 


Tyr 


Pro 


Cys 


Thr Val Asn 


Phe 


Thr 


He 






275 








280 






285 








TTC 


AAG 


GTT 


AGG 


ATG 


TAG GTG 


GGG 


GGC 


GTG 


GAG CAC AGG 


TTC 


GAA 


GCC 


910 


Phe 


Lys 


Val 


Arg 


Met 


Tyr Val 


Gly 


Gly 


Val 


Glu His Arg 


Phe 


Glu 


Ala 






290 








295 






300 










GCA 


TGC 


AAT 


TGG 


ACT 


GGA GGA 


GAG 


GGT 


TGT 


GAC TTG GAG 


GAC 


AGG 


GAT 


958 


Ala Cys 


Asn 


Trp 


Thr 


Arg Gly Glu 


Arg 


Cys 


Asp Leu Glu Asp 


Arg Asp 
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305 310 315 

AGA TCA GAG CTT AGC CCG CTG CTG CTG TCT ACA ACA GGT GAT CGA GGG 1006 
Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Gly Asp Arg Gly 
320 325 330 335 

GAG ACA CCA TCA CCA CCA TCA CTA AT AG 1034 
Gin Thr Pro Ser Pro Pro Ser Leu 
340 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 343 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

P (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

M 

0;ie Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val 
^1 5 10 15 

Olu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val 
m 20 25 30 

Si 

Lieu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg Val 
P 35 40 45 

Sr Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu Phe 
B 50 55 60 

a 

Sfir Pro Gly Ser Ala Gin Lys lie Gin Leu Val Asn Thr Asn Gly Ser 
" 65 70 75 80 

Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin Thr 
85 90 95 

Gly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn Ser Ser Gly 
100 105 110 

Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser He Asp Lys Phe Ala Gin 
115 120 125 

Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser Asp Gin Arg 
130 135 140 

Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He Val Pro Ala 
145 150 155 160 

Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val 
165 170 175 

Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn Trp Gly Ala 
180 185 190 

Asn Asp Ser Asp Val Leu He Leu Asn Asn Thr Arg Pro Pro Arg Gly. 
195 200 205 
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Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe Thr Lys Thr 
210 215 220 

Cys Gly Gly Pro Pro Cys Asn lie Gly Gly Ala Gly Asn Asn Thr Leu 
225 230 235 240 

Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr Tyr Ala 
245 250 255 

Ara Cvs Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met Val His Tyr 
260 265 270 

Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr He Phe 
275 280 285 

Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Phe Glu Ala Ala 
290 295 300 

C^s Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg 
ais 310 315 320 

Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Gly Asp Arg Gly Gin 

yj 

f'r Pro Ser Pro Pro Ser Leu 
340 

III 

J5) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 945 base pairs 

ij (B) TYPE: nucleic acid 

Q (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..942 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1..939 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

ATG GTG GGG AAC TGG GCT AAG GTT TTG GTT GTG ATG CTA CTC TTT GCC 48 
Met Val Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala 
15 10 15 

GGC GTC GAC GGG CAT ACC CGC GTG TCA GGA GGG GCA GCA GCC TCC GAT 96 
Gly Val Asp Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp 
20 25 30 
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ACC AGG GGC CTT GTG TCC CTC TTT AGC CCC GGG TCG GCT CAG AAA ATC 
Thr Arg Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys lie 
35 40 45 

CAG CTC GTA AAC ACC AAC GGC AGT TGG CAC ATC AAC AGG ACT GCC CTG 
Gin Leu Val Asn Thr Asn Gly Ser Trp His He Asn Arg Thr Ala Leu 
50 55 60 

AAC TGC AAC GAC TCC CTC CAA ACA GGG TTC TTT GCC GCA CTA TTC TAC 
Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr 
65 70 75 80 

AAA CAC AAA TTC AAC TCG TCT GGA TGC CCA GAG CGC TTG GCC AGC TGT 
Lys His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys 
85 90 95 

CGC TCC ATC GAC AAG TTC GCT CAG GGG TGG GGT CCC CTC ACT TAC ACT 
Arg Ser He Asp Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr 
□ 100 105 110 

<g^G CCT AAC AGC TCG GAC CAG AGG CCC TAC TGC TGG CAC TAC GCG CCT 
^lu Pro Asn Ser Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro 
|f 115 120 125 

^A CCG TGT GGT ATT GTA CCC GCG TCT CAG GTG TGC GGT CCA GTG TAT 

ilrg Pro Cys Gly He Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr 
yi 130 135 140 

^C TTC ACC CCG AGC CCT GTT GTG GTG GGG ACG ACC GAT CGG TTT GGT 

'dj/s Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly 

5|5 150 155 160 

^C CCC ACG TAT AAC TGG GGG GCG AAC GAC TCG GAT GTG CTG ATT CTC 
^1 Pro Thr Tyr Asn Trp Gly Ala Asn Asp Ser Asp Val Leu He Leu 
^ 165 170 175 

AAC AAC ACG CGG CCG CCG CGA GGC AAC TGG TTC GGC TGT ACA TGG ATG 
Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met 
180 185 190 

AAT GGC ACT GGG TTC ACC AAG ACG TGT GGG GGC CCC CCG TGC AAC ATC 
Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He 
195 200 205 

GGG GGG GCC GGC AAC AAC ACC TTG ACC TGC CCC ACT GAC TGT TTT CGG 
Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg 
210 215 220 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



AAG CAC CCC GAG GCC ACC TAC GCC AGA TGC GGT TCT GGG CCC TGG CTG 
Lys His Pro Glu Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu 
225 230 235 240 



720 



ACA CCT AGG TGT ATG GTT CAT TAC CCA TAT AGG CTC TGG CAC TAC CCC 

Thr Pro Arg Cys Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro 
245 250 255 

TGC ACT GTC AAC TTC ACC ATC TTC AAG GTT AGG ATG TAC GTG GGG GGC 

Cys Thr Val Asn Phe Thr He Phe Lys Val Arg Met Tyr Val Gly Gly 

260 265 270 

GTG GAG CAC AGG TTC GAA GCC GCA TGC AAT TGG ACT CGA GGA GAG CGT 



768 



816 



864 



HCVENV..T 



Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg 
275 280 285 

TGT GAC TTG GAG GAG AGG GAT AGA TCA GAG CTT AGC CCG CTG CTG CTG .912 
Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu 
290 295 300 

TGT ACA ACA GAG TGG GAG AGC TTA ATT AAT TAG 945 
Ser Thr Thr Glu Trp Gin Ser Leu lie Asn 
305 310 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 314 amino acids 

(B) TYPE: amino acid 
P (D) TOPOLOGY: linear 

%Q (ii) MOLECULE TYPE: protein 

SI 

ut (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

I^t Val Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala 
lUi 5 10 15 



Gly Val Asp Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp 
U 20 25 30 

^r Arg Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys He 
35 40 45 

Q 

®n Leu Val Asn Thr Asn Gly Ser Trp His He Asn Arg Thr Ala Leu 
50 55 60 

Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr 
65 70 75 80 

Lvs His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys 
85 90 95 

Arg Ser He Asp Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr 
100 105 110 

Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro 
115 120 125 

Arg Pro Cys Gly He Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr 
130 135 140 

Cys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly 
145 150 155 160 

Val Pro Thr Tyr Asn Trp Gly Ala Asn Asp Ser Asp Val Leu He Leu 
165 170 175 

Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met 
180 185 190 

Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He 
195 200 205 



Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg 
210 215 220 

Lvs His Pro Glu Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu 
225 230 235 240 

Thr Pro Arg Cys Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro 
245 250 255 

Cys Thr Val Asn Phe Thr He Phe Lys Val Arg Met Tyr Val Gly Gly 
260 265 270 

Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg 
275 280 285 

Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu 
P 290 295 300 

,Ser Thr Thr Glu Trp Gin Ser Leu He Asn 
#5 310 



\jfk) INFORMATION FOR SEQ ID NO: 43: 

□ 

ry (i) SEQUENCE CHARACTERISTICS: 
jH (A) LENGTH: 961 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
g (D) TOPOLOGY: linear 

H (ii) MOLECULE TYPE: cDNA 

Q 

p (iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..958 

(ix) FEATURE: 

(A) NAME/KEY: matjpeptide 

(B) LOCATION: 1..955 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

ATG GTG GGG AAC TGG GCT AAG GTT TTG GTT GTG ATG CTA CTC TTT GCC 
Met Val Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala 
15 10 15 

GGC GTC GAC GGG CAT ACC CGC GTG TCA GGA GGG GCA GCA GCC TCC GAT 
Gly Val Asp Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp 
20 25 30 

ACC AGG GGC CTT GTG TCC CTC TTT AGC CCC GGG TCG GCT CAG AAA ATC 
Thr Aro Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys He 
35 40 45 

CAG CTC GTA AAC ACC AAC GGC AGT TGG CAC ATC AAC AGG ACT GCC CTG 



U/«\/CKA/ -T 




Gin Leu Val Asn Thr Asn Gly Ser Trp His He Asn Arg Thr Ala Leu 
50 55 60 

AAC TGC AAC GAG TCC CTC CAA ACA GGG TTC TIT GCC GCA CTA TTC TAG 
Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr 
65 70 75 80 

AAA CAC AAA TTG AAC TCG TCT GGA TGC CCA GAG CGC TTG GCC AGC TGT 
Lys His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys 
85 90 



95 



CGC TCC ATC GAC AAG TTC GCT CAG GGG TGG GGT CCC CTC ACT TAC ACT 
Arg Ser He Asp Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr 
100 105 110 

GAG CCT AAC AGC TCG GAC CAG AGG CCC TAC TGC TGG CAC TAC GCG CCT 
Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro 
115 120 125 



'^A CCG 
'^g Pro 
H 130 

w 

@3C TTC 
:€ys Phe 

^TC CCC 
yi£al Pro 

□ 



TGT GGT ATT GTA CCC GCG TCT CAG GTG TGC GGT CCA GTG TAT 
Cys Gly He Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr 
135 140 

ACC CCG AGC CCT GTT GTG GTG GGG ACG ACC GAT CGG TTT GGT 
Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly 
150 155 160 

ACG TAT AAC TGG GGG GCG AAC GAC TCG GAT GTG CTG ATT CTC 
Thr Tyr Asn Trp Gly Ala Asn Asp Ser Asp Val Leu He Leu 
165 170 175 



^C AAC ACG CGG CCG CCG CGA GGC AAC TGG TTC GGC TGT ACA TGG ATG 
ysn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met 
S 180 185 190 

AAT GGC ACT GGG TTC ACC AAG ACG TGT GGG GGC CCC CCG TGC AAC ATC 
Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He 
195 200 205 

GGG GGG GCC GGC AAC AAC ACC TTG ACC TGC CCC ACT GAC TGT TTT CGG 
Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg 
' 210 215 220 

AAG CAC CCC GAG GCC ACC TAC GCC AGA TGC GGT TCT GGG CCC TGG CTG 
Lys His Pro Glu Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu 
225 230 235 240 

ACA CCT AGG TGT ATG GTT CAT TAC CCA TAT AGG CTC TGG CAC TAC CCC 
Thr Pro Arg Cys Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro 
245 250 255 

TGC ACT GTC AAC TTC ACC ATC TTC AAG GTT AGG ATG TAC GTG GGG GGC 
Cys Thr Val Asn Phe Thr He Phe Lys Val Arg Met Tyr Val Gly Gly 
260 265 270 

GTG GAG CAC AGG TTC GAA GCC GCA TGC AAT TGG ACT CGA GGA GAG CGT 
Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg 
275 280 285 

TGT GAC TTG GAG GAC AGG GAT AGA TCA GAG CTT AGC CCG CTG CTG CTG 
Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 



912 



1 1^ /*rkr%/ T 



961 



290 295 300 

TCT ACA ACA GGT GAT CGA GGG CAG ACA CCA TCA CCA CCA TCA CTA A 958 
Ser Thr Thr Gly Asp Arg Gly Gin Thr Pro Ser Pro Pro Ser Leu 
305 310 315 

TAG 

(2) INFOE^TION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 319 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

□ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Val Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala 
rTi 5 10 15 

^"^y Val Asp Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp 
20 25 30 

Sir Arg Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys He 
35 40 45 

j_5. 

'Gin Leu Val Asn Thr Asn Gly Ser Trp His He Asn Arg Thr Ala Leu 
^ 50 55 60 

^n Cys Asn Asp Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr 
p5 70 75 80 • 

Lvs His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys 
85 90 95 

Arg Ser He Asp Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr 
100 105 110 

Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro 
115 120 125 

Arg Pro Cys Gly He Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr 
130 135 140 

Cys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly 
145 150 155 160 

Val Pro Thr Tyr Asn Trp Gly Ala Asn Asp Ser Asp Val Leu He Leu 
165 170 175 

Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met 
180 185 190 

Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He 
195 200 205 

Glv Gly Ala Gly Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg 

^ 210 215 220 

Lys His Pro Glu Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu 



225 230 235 240 

Thr Pro Arg Cys Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro 
245 250 255 

Cys Thr Val Asn Phe Thr He Phe Lys Val Arg Met Tyr Val Gly Gly 
260 265 270 

Val Giu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg 
275 280 285 

Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu 
290 295 300 

Ser Thr Thr Gly Asp Arg Gly Gin Thr Pro Ser Pro Pro Ser Leu 
305 310 315 

□2) INFORMATION FOR SEQ ID NO: 45: 

^ (i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 1395 base pairs 

(B) TYPE: nucleic acid 
Ul (C) STRANDEDNESS : single 

p (D) TOPOLOGY: linear 

m 

(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



{ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1392 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1..1389 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 



ATG 


GTG 


GCG 


GGG GCC 


CAT 


TGG 


GGA 


GTC 


CTG 


GCG 


GGC CTC GCC TAC TAT 


Met 


Val 


Ala 


Gly Ala 


His 


Trp 


Gly 


Val 


Leu 


Ala 


Gly Leu Ala Tyr Tyr 


1 






5 










10 




15 


TCC 


ATG 


GTG 


GGG AAC 


TGG 


GCT 


AAG 


GTT 


TTG 


GTT 


GTG ATG CTA CTC TTT 


Ser 


Met 


Val 


Gly Asn Trp 


Ala 


Lys 


Val 


Leu 


Val 


Val Met Leu Leu Phe 








20 








25 






30 


GCC 


GGC 


GTC 


GAC GGG 


CAT 


ACC 


CGC 


GTG 


TCA 


GGA 


GGG GCA GCA GCC TCC 


Ala Gly Val 


Asp Gly His 


Thr 


Arg 


Val 


Ser 


Gly 


Gly Ala Ala Ala Ser 






35 








40 








45 


GAT 


ACC 


AGG 


GGC CTT 


GTG 


TCC 


CTC 


TTT 


AGC 


CCC 


GGG TCG GCT CAG AAA 


Asp 


Thr Arg 


Gly Leu 


Val 


Ser 


Leu 


Phe 


Ser 


Pro 


Gly Ser Ala Gin Lys 


50 








55 










60 


ATC 


CAG 


CTC 


GTA AAC 


ACC 


AAC 


GGC 


AGT 


TGG 


CAC 


ATC AAC AGG ACT GCC 


He 


Gin 


Leu 


Val Asn 


Thr 


Asn 


Gly 


Ser 


Trp 


His 


He Asn Arg Thr Ala 


65 








70 










75 


80 



48 



96 



144 



192 



240 



si 



225 



305 



AAC 


TGC AAC GAC TCC 


CTC CAA ACA 


GGG 


TTC 


TTT 


GCC 


GCA 


CTA 


TTC 


288 


Asn 


Cys Asn Asp Ser Leu Gin Thr Gly 


Phe 


Phe 


Ala 


Ala 


Leu 


Phe ■ 






85 




90 










95 






AAA 


CAC AAA TTC AAC 


TCG TCT GGA 


TGC 


CCA 


GAG 


CGC 


TTG 


GCC 


AGC 


336 


Lys 


His Lys Phe Asn 


Ser Ser Gly Cys 


Pro 


Glu 


Ara 


Leu 


Ala 


Ser 




100 


105 










110 








CGC 


TCC ATC GAC AAG 


TTC GCT CAG 


GGG 


TGG 


GGT 


CCC 


CTC 


ACT 


TAC 


384 


Arg 


Ser lie Asp Lys 


Phe Ala Gin Gly 


TrD 


Glv 


Pro 


Leu 


Thr 


Tyr 




115 


120 








125 










GAG 


CCT AAC AGC TCG 


GAC CAG AGG 


CCC 


TAC 


TGC 


TGG 


CAC 


TAC 


GCG 


432 


Giu 


Pro Asn Ser Ser Asp Gin Arg 


Pro 


Tvr 


Cys 


TrD 


His 


Tvr 


Ala 




130 




135 






140 












CGA 


CCG TGT GGT ATT 


GTA CCC GCG 


TCT 


CAG 


GTG 


TGC 


GGT 


CCA 


GTG 


480 


Arg 


Pro Cys Gly lie 


Val Pro Ala 


Ser 


Gin 


Val 


Cys 


Gly 


Pro 


Val 




150 






155 










160 






TTC ACC CCG AGC 


CCT GTT GTG 


GTG 


GGG 


ACG 


ACC 


GAT 


CGG 


TTT 


528 


Cys 


Phe Thr Pro Ser 


Pro Val Val 


Val 


Glv 


Thr 


Thr 


Asp 


Arg 


Phe 




165 




170 










175 






GTC- 


CCC ACG TAT AAC 


TGG GGG GCG 


AAC 


GAC 


TCG 


GAT 


GTG 


CTG 


ATT 


576 


V dJ. 


Pro Thr Tyr Asn Trp Gly Ala 


Asn 


Asp 


Ser 


Asp 


Val 


Leu 


He 




180 


185 










190 








AAC 


AAC ACG CGG CCG 


CCG CGA GGC 


AAC 


TGG 


TTC 


GGC 


TGT 


ACA 


TGG 


624 


Asn 


Asn Thr Arg Pro 


Pro Arg Gly Asn 


Tro 


Phe 


Gly 


Cys 


Thr 


Trp 






195 


200 








205 










WAT* 

AAT 


GGC ACT GGG TTC 


ACC AAG ACG 


TGT 


GGG 


GGC 


CCC 


CCG 


TGC 


AAC 


672 


Asn 


Gly Thr Gly Phe 


Thr Lys Thr Cys 


Glv 


Glv 


Pro 


Pro 


Cvs 


Asn 




210 


215 






220 












GGG 


GGG GCC GGC AAC 


AAC ACC TTG 


ACC 


TGC 


CCC 


ACT 


GAC 


TGT 


TTT 


720 


Giy 


Gly Ala Gly Asn 


Asn Thr Leu 


Thr 


Cys 


Pro 


Thr 


Asp 


Cvs 


Phe 




230 
















240 




AAG 


CAC CCC GAG GCC 


ACC TAC GCC 


AGA 


TGC 


GGT 


TCT 


GGG 


CCC 


TGG 


768 


Lys 


His Pro Glu Ala Thr Tyr Ala Arg 


Cys 


Gly 


Ser 


Gly 


Pro 


Trp 




245 




250 










255 








CCT AGG TGT ATG 


GTT CAT TAC 


CCA 


TAT 


AGG 


CTC 


TGG 


CAC 


TAC 


816 


Thr 


Pro Arg Cys Met 


Val His Tyr 


Pro 


Tyr 


Arg 


Leu 


Trp 


His 


Tyr 






260 


265 










270 






864 




ACT GTC AAC TTC 


ACC ATC TTC 


AAG 


GTT 


AGG 


ATG 


TAC 


GTG 


GGG 


Cys 


Thr Val Asn Phe 


Thr He Phe 


Lys 


Val 


Arg 


Met 


Tyr 


Val 


Gly 




275 


280 








285 










GTG 


GAG CAC AGG TTC 


GAA GCC GCA 


TGC 


AAT 


TGG 


ACT 


CGA 


GGA 


GAG 


912 


Val 


Glu His Arg Phe 


Glu Ala Ala 


Cys 


Asn 


Trp 


Thr 


Arg 


Gly 


Glu 




290 


295 






300 












TGT 


GAC TTG GAG GAC 


AGG GAT AGA 


TCA 


GAG 


CTT 


AGC 


CCG 


CTG 


CTG 


960 


Cys 


Asp Leu Glu Asp Arg Asp Arg 


Ser 


Glu 


Leu 


Ser 


Pro 


Leu 


Leu 




310 






315 










320 




TCT 


ACA ACA GAG TGG 


CAG ATA CTG 


CCC 


TGT 


TCC 


TTC 


ACC 


ACC 


CTG 


1008 



ur\/CM\/ -T 





Leu Ser Thr Thr Giu Trp Gin lie Leu Pro Cys Ser Phe Thr Thr Leu 
325 330 335 

CCG GCC CTA TCC ACC GGC CTG ATC CAC CTC CAT CAG AAC ATC GTG GAG 
Pro Ala Leu Ser Thr Gly Leu He His Leu His Gin Asn He Val Asp 
340 345 350 

GTG CAA TAG CTG TAG GGT GTA GGG TCG GCG GTT GTC TCC CTT GTC ATC 
Val Gin Tyr Leu Tyr Gly Val Gly Ser Ala Val Val Ser Leu Val He 
355 360 365 

AAA TGG GAG TAT GTC CTG TTG CTC TTC CTT CTC CTG GCA GAC GCG CGC 
Lys Trp Glu Tyr Val Leu Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg 
370 375 380 



ATC TGC 
Ue Cys 

f 

TTA 
Ma Leu 

GGC 
tefes Gly 



GCC TGC 
Ala Cys 



GAG AAC 
Glu Asn 



ACT CTT 
Thr Leu 
420 



MG GGC AGG CTG 
ms Gly Arg Leu 
U 435 

PI 

<S^G CTG CTC CTG 
Pro Leu Leu Leu 

1=^ 450 



1056 



1104 



1152 



TTA TGG ATG ATG CTG CTG ATA GCT CAA GCT GAG GCC 1200 
Leu Trp Met Met Leu Leu He Ala Gin Ala Glu Ala 
390 395 400 

CTG GTG GTC CTC AAT GCG GCG GCC GTG GCC GGG GCG 1248 
Leu Val Val Leu Asn Ala Ala Ala Val Ala Gly Ala 
405 410 415 

TCC TTC CTT GTG TTC TTC TGT GCT GCC TGG TAG ATC 1296 
Ser Phe Leu Val Phe Phe Cys Ala Ala Trp Tyr He 
425 430 

GTC GCT GGT GCG GCA TAG GCC TTC TAT GGC GTG TGG 134 4 

Val Pro Gly Ala Ala Tyr Ala Phe Tyr Gly Val Trp 
440 445 

CTT CTG CTG GCC TTA CCA CCA CGA GCT TAT GCC TAGTAA 1395 
Leu Leu Leu Ala Leu Pro Pro Arg Ala Tyr Ala 
455 460 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 63 amino acids 
{B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Met Val Ala Gly Ala His Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr 
15 10 15 

Ser Met Val Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe 
20 25 30 

Ala Gly Val Asp Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser 
35 40 45 

Asp Thr Arg Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys 
50 55 60 

He Gin Leu Val Asn Thr Asn Gly Ser Trp His He Asn Arg Thr Ala 
65 70 75 80 



Leu Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe 
85 90 95 

Tyr Lys His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser 
100 105 110 

Cys Arg Ser He Asp Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr 
115 120 125 

Thr Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala 
130 135 140 



Pro Arg 


Pro 




145 






Tyr Cys 


Phe 


Thr 


C| 

Siy Val 


Pro 


Thr 






180 


beu Asn 


Asn 


Thr 




195 










mt Asn 


Gly 


Thr 


ip 210 

•sis 






lie Gly 


Gly 


Ala 


^5 






■p 








His 


Pro 


2 

blu Thr 


Pro 


Arg 






260 


Pro Cys 


Thr 


Val 




275 




Gly Val 


Glu 


His 


290 






Arg Cys 


Asp 


Leu 


305 






Leu Ser 


Thr 


Thr 



150 



155 



160 



165 



170 



175 



185 



190 



200 



205 



215 



220 



230 



235 



240 



245 



250 



255 



feu Thr Pro Arg Cys Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr 

265 270 



280 



285 



295 



300 



310 



315 



320 



325 



330 



335 



Pro Ala Leu Ser Thr Gly Leu He His Leu His Gin Asn lie Val Asp 
340 345 350 

Val Gin Tyr Leu Tyr Gly Val Gly Ser Ala Val Val Ser Leu Val He 
355 360 365 

Lys Trp Glu Tyr Val Leu Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg 
370 375 380 

He Cys Ala Cys Leu Trp Met Met Leu Leu He Ala Gin Ala Glu Ala 
385 390 395 400 



Ala Leu Glu Asn Leu Val Val Leu Asn Ala Ala Ala Val Ala Gly Ala 



405 



410 



415 



His Gly Thr Leu Ser Phe Leu Val Phe Phe Cys Ala Ala Trp Tyr He 

420 425 430 

Lys Gly Arg Leu Val Pro Gly Ala Ala Tyr Ala Phe Tyr Gly Val Trp 

435 440 445 



Pro Leu Leu Leu Leu Leu Leu Ala Leu Pro Pro Arg Ala Tyr Ala 
450 455 460 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2082 base pairs 

(B) TYPE: nucleic acid 
^ (C) STRANDEDNESS: single 
^ (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

y (iii) HYPOTHETICAL: NO 

Si (iii) ANTI-SENSE: NO 

m 

s (ix) FEATURE: 

(A) NAME/KEY: CDS 
•Q (B) LOCATION: 1..2079 

Q:. (ix) FEATURE: 

(A) NAME/KEY: mat^peptide 
g (B) LOCATION: 1..2076 

^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

AAT TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 48 
Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 96 
Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 14 4 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 192 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 240 
Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

rCC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 288 
Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 



TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 

115 120 125 

CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 

130 135 140 

GTC GAT TTG CTC GTT GGG GCG GCT GCT TTC TGT TCC GCT ATG TAC GTG 

Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 

145 150 155 160 



GGG GAC CTC TGC GGA 
iky Asp Leu Cys Gly 
165 



tTCG CCT CGC CGG CAT 
iSer Pro Arg Arg His 
^ 180 



g|:C GGC CAC ATA ACG 
iffo Gly His He Thr 

tec TCG CCT ACA ACG 
^rp Ser Pro Thr Thr 

I* 210 



TCT GTC TTC CTC GTC TCC CAG CTG TTC ACC ATC 
Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
170 175 

GAG ACG GTG CAG GAC TGC AAT TGC TCA ATC TAT 
Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
185 190 

GGT CAC CGT ATG GCT TGG GAT ATG ATG ATG AAC 
Gly His Arg Met Ala Trp Asp Met Met Met Asn 
200 205 

GCC CTG GTG GTA TCG CAG CTG CTC CGG ATC CCA 
Ala Leu Val Val Ser Gin Leu Leu Arg lie Pro 
215 220 



^ GCT GTC GTG GAC ATG GTG GCG GGG GCC CAT TGG GGA GTC CTG GCG 
Win Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val Leu Ala 
f25 230 235 240 

GGC CTC GCC TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG GTT TTG GTT 
Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu Val 
245 250 255 

GTG ATG CTA CTC TTT GCC GGC GTC GAC GGG CAT ACC CGC GTG TCA GGA 
Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg Val Ser Gly 
260 265 270 

GGG GCA GCA GCC TCC GAT ACC AGG GGC CTT GTG TCC CTC TTT AGC CCC 
Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu Phe Ser Pro 
275 280 285 

GGG TCG GCT CAG AAA ATC CAG CTC GTA AAC ACC AAC GGC AGT TGG CAC 
Gly Ser Ala Gin Lys He Gin Leu Val Asn Thr Asn Gly Ser Trp His 
290 295 300 

ATC AAC AGG ACT GCC CTG AAC TGC AAC GAC TCC CTC CAA ACA GGG TTC 
He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe 
305 310 315 320 

TTT GCC GCA CTA TTC TAC AAA CAC AAA TTC AAC TCG TCT GGA TGC CCA 
Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn Ser Ser Gly Cys Pro 
325 330 335 

GAG CGC TTG GCC AGC TGT CGC TCC ATC GAC AAG TTC GCT CAG GGG TGG 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 



Glu Arg 


Leu 


AX a 


Cat- 

aer 


i*.ys Axg 




He Asp 


Lvs Phe Ala Gin 


Glv 

WX 


Tro 






J 4 U 








345 




350 








GGT CCC 


CTC 


ACT 


TAC 


ACT GAG 


CCT 


AAC 


AGC 


TCG GAC CAG AGG 


CCC 


TAC 


1104 


Gly Pro 


Leu 


Tfir 


Tyr 


inZ VjXU 


Pro 


Asn 


C> A M 

Ser 


Ser AsD Gin Ard 


Pro 


Tvr 




355 






360 






365 








TGC TGG 


CAC 


TAC 


GCG 


CCT CGA 


CCG 


TGT 


GGT 


ATT GTA CCC GCG 


TCT 


CAG 


1152 


Cys Trp 


His 


Tyr 


AX a 


fro Aig 


Pro Cys 


Gly 


Tie Vai Pro Ala 


Ser 


Gin 




370 








^ f J 








380 








GTG TGC 


GGT 


CCA 


GTG 


lAi ivjC 


TTC 


ACC 


CCG 


AGC CCT GTT GTG 


GTG 


GGG 


1200 


Vai Cys 


Gly 


Ova 

Pro 


vax 


Tyr ^ys 


Phe 


Thr 


Pro 


Ser Pro Vai Vai 


Vai 


Glv 




385 














395 

<j ^ -J 




400 




ACG ACC 


GAT 


CGG 


TTT 


Gu i u i 


CCC 


ACG 


TAT 


AAC TGG GGG GCG 

rv^w * WW www www 


AAC 


GAC 


1248 


Thr Thr 


Asp 


Arg 


It liC 


Gly Vai 


Pro 


Thr Tyr 


Asn Trp Gly Ala 


Asn 


Asp 






405 








410 




415 






%G GAT 


GTG 


CTG 


ATT 


CTC AAC 


AAC 


ACG 


CGG 


CCG CCG CGA GGC 


AAC 


TGG 


1296 


Asp 


vai 


Leu 


xxe 


Leu Asn 


Asn 


Thr Arg 


Pro Pro Ara Glv 


Asn 


Tro 








420 








425 




430 
































IRTC GGC 


TGT 


ACA 


TGG 


ATG AAT 


GGC 


ACT 


GGG 


TTC ACC AAf^ ACQ 
1 1 Vw nww nr\\j f\\^\3 


TGT 

X w X 


GGG 


1344 


ipie Gly 

nj 


Cys 


Tiir 


Trp 


Met Asn Gly Thr Gly 


Phe Thr Lvs Thr 


Cvs 

WJf J 


Glv 

W^ 




435 








440 






445 








s p 

g6c CCC 


CCG 


TGC 


AAC 


ATC GGG 


GGG 


GCC 


GGC 


AAC AAC ACC TTG 


ACC 


TGC 


1392 


Bly Pro 


Pro 


Cys 


Asn 


He Gly Gly Ala Gly 


A^n Asn Thr Leu 


Thr 


Cys 




450 






455 








4 60 








□ 
























pec ACT 


GAC 


TGT 


TTT 


CGG AAG 


CAC 


CCC 


GAG 


GCC ACC TAC GCC 


AGA 


TGC 


1440 




Asp 


Cys 


rne 


Arg Lys 


His 


Pro 


Glu 


Ala Thr Tvr Ala 


Arg 


Cys 




^o Thr 




470 








475 




480 




b'GT TCT 


GGG 


CCC 


TGG 


CTG ACA 


CCT 


AGG 


TGT 


ATG GTT CAT TAC 


CCA 


TAT 


1488 


Gly Ser 


Gly 


Pro 


Trp 


Leu Thr 


Pro Arg Cys 


Mef Vai Hi<5 Tvr 
LTic u vox nxo ^ jr 


Pro 


Tvr 






0 J 








490 




495* 






AGG CTC 


TGG 


CAC 


TAC 


CCC TGC 


ACT 


GTC 


AAC 


TTC ACC ATC TTC 


AAG 


GTT 


1536 


Arg Leu 


Trp 


His 


Tyr 


Pro Cys 


Thr 


Vai 


Asn 


Phe Thr Tie Phe 

Cilw XilX XX^ C^liw 


Lys 


Val 




500 








505 




510 








AGG ATG 


TAC 


GTG 


GGG 


GGC GTG 


GAG 


CAC 


AGG 


TTC GAA GCC GCA 


TGC 


AAT 


1584 


Arg Met 


Tyr 


vai 


Gxy 


Gly Vai 


Glu 


His Arg 


Phe Glu Ala Ala 

1 1 w w X u c\x a nx u 


Cys Asn 




C 1 c 

515 








520 






525 








TGG ACT 


CGA 


GGA 


GAG 


CGT TGT 


GAC 


TTG 


GAG 


GAC AGG GAT AGA 


TCA 


GAG 


. 1632 


Trp Thr 


Arg 


bxy 


Pin 
uXU 


Arg Cys 


Asp 


Leu 


Glu 


AsD Ara Asd Arcr 


Ser 


Glu 




530 




535 








540 








CTT AGC 


CCG 


CTG 


CTG 


CTG TCT 


ACA 


ACA 


GAG 


TGG CAG ATA CTG 


CCC 


TGT 


1680 


Leu Ser 


n w A 

Pro 


Leu 


Leu 


Leu Ser 


Thr 


Thr 


Glu 


Tro Gin lie Leu 


Pro Cys 




545 








550 








555 




560 




TCC TTC 


ACC 


ACC 


CTG 


CCG GCC 


CTA 


TCC 


ACC 


GGC CTG ATC CAC 


CTC 


CAT 


1728 


Ser Phe 


Thr 


Thr 


Leu 


Pro Ala 


Leu 


Ser 


Thr 


Gly Leu He His 


Leu 


His 








565 








570 




575 






CAG AAC 


ATC 


GTG 


GAC 


GTG CAA 


TAC 


CTG 


TAC 


GGT GTA GGG TCG 


GCG 


GTT 


1776 


Gin Asn 


He 


Vai 


Asp 


Vai Gin 


Tyr 


Leu Tyr 


Gly Vai Gly Ser 


Ala 


Val 






580 






585 




590 









GTC TCC CTT GTC ATC AAA TGG GAG TAT GTC CTG TTG CTC TTC CTT CTC 1824 

Val Ser Leu Val lie Lys Trp Glu Tyr Val Leu Leu Leu Phe Leu Leu 

595 600 605 

CTG GCA GAC GCG CGC ATC TGC GCC TGC TTA TGG ATG ATG CTG CTG ATA 1872 

Leu Ala Asp Ala Arg He Cys Ala Cys Leu Trp Met Met Leu Leu He 

610 615 620 

GCT CAA GCT GAG GCC GCC TTA GAG AAC CTG GTG GTC CTC AAT GCG GCG 1920 

Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu Val Val Leu Asn Ala Ala 
625 630 635 640 

GCC GTG GCC GGG GCG CAT GGC ACT CTT TCC TTC CTT GTG TTC TTC TGT 1968 

Ala Val Ala Gly Ala His Gly Thr Leu Ser Phe Leu Val Phe Phe Cys 
645 650 655 

GCT GCC TGG TAC ATC AAG GGC AGG CTG GTC CCT GGT GCG GCA TAC GCC 2016 

Ma Ala Trp Tyr lie Lys Gly Arg Leu Val Pro Gly Ala Ala Tyr Ala 
^ 660 665 670 



iflc TAT GGC GTG TGG CCG CTG CTC CTG CTT CTG CTG GCC TTA CCA CCA 2064 
Ph-e Tyr Gly Val Trp Pro Leu Leu Leu Leu Leu Leu Ala Leu Pro Pro 
y 675 680 685 

(p3A GCT TAT GCC TAGTAA 
Ala Tyr Ala 
690 



£3 



p) INFORMATION FOR SEQ ID NO: 48: 

g (i) SEQUENCE CHARACTERISTICS: 

^ (A) LENGTH: 692 amino acids 

(B) TYPE: amino acid 
M (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
'1 5 10 15 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 



2082 



Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ma Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 150 155 160 

Glv Asp Leu Cys Gly Ser Va^ Phe Leu Val Ser Gin Leu Phe Thr He 
■ 165 170 175 

Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
180 185 190 

Pro Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
195 200 205 

^rp Ser Pro Thr Thr Ala Leu Val Val Ser Gin Leu Leu Arg He Pro 
210 215 220 

r^ln Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val Leu Ala 
W25 230 235 240 

P 

rdly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu Val 
Ul 245 250 255 

fval Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg Val Ser Gly 
^ 260 265 270 

P 

tely Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu Phe Ser Pro 
P 275 280 285 

iGly Ser Ala Gin Lys He Gin Leu Val Asn Thr Asn Gly Ser Trp His 
290 295 300 

He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe 
305 310 315 320 

Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn Ser Ser Gly Cys Pro 
325 330 335 

Glu Arg Leu Ala Ser Cys Arg Ser He Asp Lys Phe Ala Gin Gly Trp 
340 345 350 

Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr 
355 360 365 

Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He Val Pro Ala Ser Gin 
370 375 380 

Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val Val Gly 
385 390 395 400 

Thr Thr Asp Arg Phe' Gly Val Pro Thr Tyr Asn Trp Gly Ala Asn Asp 
405 410 415 

Ser Asp Val Leu He Leu Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp 
420 425 430 

Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly 



435 440 445 

Gly Pro Pro Cys Asn He Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys 
450 455 460 

Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr Tyr Ala Arg Cys 
465 470 475 480 

Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met Val His Tyr Pro Tyr 
485 490 495 

Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr He Phe Lys Val 
500 505 . 510 

Arg Met Tyr Val Gly Gly Val Glu His Arg Phe Glu Ala Ala Cys Asn 
515 520 525 

Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu 
530 535 540 

l^u Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gin He Leu Pro Cys 
^5 550 555 560 

sir Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly Leu He His Leu His 

^ 565 570 575 

fi i 

bin Asn He Val Asp Val Gin Tyr Leu Tyr Gly Val Gly Ser Ala Val 
, 580 585 590 



^1 Ser Leu Val He Lys Trp Glu Tyr Val Leu Leu Leu Phe Leu Leu 
"r^ 595 600 605 

Su Ala Asp Ala Arg He Cys Ala Cys Leu Trp Met Met Leu Leu He 
□ 610 615 620 

Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu Val Val Leu Asn Ala Ala 
625 630 635 640 

Ala Val Ala Gly Ala His Gly Thr Leu Ser Phe Leu Val Phe Phe Cys 
645 650 655 

Ala Ala Trp Tyr He Lys Gly Arg Leu Val Pro Gly Ala Ala Tyr Ala 
660 665 670 

Phe Tyr Gly Val Trp Pro Leu Leu Leu Leu Leu Leu Ala Leu Pro Pro 
675 680 685 

Arg Ala Tyr Ala 
690 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2433 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



urv/PMV-T 



(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..2430 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1..2427 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 



ATG AGC 
Met Ser 

□ l 

(^C CGC 
Arg 

c , t 



ACG AAT CCT AAA CCT CAA AGA AAA ACC AAA CGT AAC ACC AAC 
Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
5 10 15 

CCA CAG GAC GTC AAG TTC CCG GGC GGT GGT CAG ATC GTT GGT 
Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 
20 25 30 



(^A GTT TAG CTG TTG CCG CGC AGG GGC CCC AGG TTG GGT GTG CGC GCG 
^y Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
ij 35 40 45 

ACT AGG AAG ACT TCC GAG CGG TCG CAA CCT CGT GGG AGG CGA CAA CCT 
ffir Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
□50 55 60 

^C CCC AAG GCT CGC CGA CCC GAG GGT AGG GCC TGG GCT CAG CCC GGG 
Me Pro Lys Ala Arg Arg Pro Glu Gly Arg Ala Trp Ala Gin Pro Gly 
g5 70 75 80 

TAC CCT TGG CCC CTC TAT GGC AAT GAG GGC ATG GGG TGG GCA GGA TGG 
Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Met Gly Trp Ala Gly Trp 
85 90 ' 95 

CTC CTG TCA CCC CGC GGC TCT CGG CCT AGT TGG GGC CCT ACA GAC CCC 
Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 
100 105 110 

CGG CGT AGG TCG CGT AAT TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC 
Arg Arg Arg Ser Arg Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys 
115 120 125 

GGC TTC GCC GAC CTC GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA 
Gly Phe Ala Asp Leu Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu 
130 135 140 

GGG GGC GCT GCC AGG GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC 
Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

GGC GTG AAC TAT GCA ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC 
Glv Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
' 165 170 175 

TTC CTC TTG GCT TTG CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT 
Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 





180 185 190 

GAA GTG CGC AAC GTG TCC GGG ATG TAG CAT GTC ACG AAC GAG TGC TCC 624 

Glu Val Arg Asn Val Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser 
195 200 205 

AAC TCA AGC ATT GTG TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC 672 

Asn Ser Ser He Val Tyr Glu Ala Ala Asp Met He Met His Thr Pro 
210 215 220 

GGG TGC GTG CCC TGC GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA 720 

Gly Cys Val Pro Cys Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val 

225 230 235 240 

GCG CTC ACC CCC ACG CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG 768 

Ala Leu Thr Pro Thr Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr 

245 250 255 - 

□ 

^A ATA CGA CGC CAC GTC GAT TTG CTC GTT GGG GCG GCT GCT TTC TGT 816 

•^r He Arg Arg His Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys 
260 265 270 

M 

iMc GCT ATG TAC GTG GGG GAC CTC TGC GGA TCT GTC TTC CTC GTC TCC 864 

Sr Ala Met Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser 
nJ 275 280 285 

CAG CTG TTC ACC ATC TCG CCT CGC CGG CAT GAG ACG GTG CAG GAC TGC 912 

Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys 
.f^ 290 295 300 



960 



ij^T TGC TCA ATC TAT CCC GGC CAC ATA ACG GGT CAC CGT ATG GCT TGG 
Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg Met Ala Trp 
^5 310 315 320 

(Mt ATG ATG ATG AAC TGG TCG CCT ACA ACG GCC CTG GTG GTA TCG CAG 1008 
Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val Val Ser Gin 
325 330 335 

CTG CTC CGG ATC CCA CAA GCT GTC GTG GAC ATG GTG GCG GGG GCC CAT 1056 
Leu Leu Arg He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His 
340 345 350 

TGG GGA GTC CTG GCG GGC CTC GCC TAC TAT TCC ATG GTG GGG AAC TGG 1104 
Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp 
355 360 365 

GCT AAG GTT TTG GTT GTG ATG CTA CTC TTT GCC GGC GTC GAC GGG CAT 1152 
Ala Lys Val Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His 
'370 375 380 

ACC CGC GTG TCA GGA GGG GCA GCA GCC TCC GAT ACC AGG GGC CTT GTG 1200 
Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val 
385 390 395 400 

TCC CTC TTT AGC CCC GGG TCG GCT CAG AAA ATC CAG CTC GTA AAC ACC 1248 
Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys He Gin Leu Val Asn Thr 
405 410 415 

AAC GGC AGT TGG CAC ATC AAC AGG ACT GCC CTG AAC TGC AAC GAC TCC 1296 
Asn Gly Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser 
420 425 430 



MrVPNV.T 



CTC CAA ACA GGG TTC TTT GCC GCA CTA TTC TAG AAA CAC AAA TTC AAC 1344 
Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn 
435 440 445 

TCG TCT GGA TGC CCA GAG CGC TTG GCC AGC TGT CGC TCC ATC GAC AAG 1392 
Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser He Asp Lys 
450 455 460 

TTC GCT CAG GGG TGG GGT CCC CTC ACT TAG ACT GAG CCT AAC AGC TCG 1440 
Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser 
465 470 475 480 

GAC CAG AGG CCC TAC TGC TGG CAC TAC GCG CCT CGA CCG TGT GGT ATT 1488 
Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He 
485 490 495 

@TA CCC GCG TCT CAG GTG TGC GGT CCA GTG TAT TGC TTC ACC CCG AGC 1536 

u^al Pro Ala Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser 
^ 500 505 ■ 510 

l!cT GTT GTG GTG GGG ACG ACC GAT CGG TTT GGT GTC CCC ACG TAT AAC 1584 

'^^ro Val Val Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn 

P 515 520 525 

||gG GGG GCG AAC GAC TCG GAT GTG CTG ATT CTC AAC AAC ACG CGG CCG 1632 

"trp Gly Ala Asn Asp Ser Asp Val Leu He Leu Asn Asn Thr Arg Pro 

f 530 535 540 

1=53* 

mCG CGA GGC AAC TGG TTC GGC TGT ACA TGG ATG AAT GCC ACT GGG TTC 1680 

l=Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe 
^45 550 555 560 

QcC AAG ACG TGT GGG GGC CCC CCG TGC AAC ATC GGG GGG GCC GGC AAC 1728 
rfhr Lys Thr Cys Gly Gly Pro Pro Cys Asn He Gly Gly Ala Gly Asn 
565 570 575 

AAC ACC TTG ACC TGC CCC ACT GAC TGT TTT CGG AAG CAC CCC GAG GCC 1776 
Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala 
580 585 590 

ACC TAC GCC AGA TGC GGT TCT GGG CCC TGG CTG ACA CCT AGG TGT ATG 1824 
Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met 
595 600 605 

GTT CAT TAC CCA TAT AGG CTC TGG CAC TAC CCC TGC ACT GTC AAC TTC 1872 
Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe 
610 615 620 

ACC ATC TTC AAG GTT AGG ATG TAC GTG GGG GGC GTG GAG CAC AGG TTC 1920 
Thr He Phe Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Phe 
625 630 635 640 

GAA GCC GCA TGC AAT TGG ACT CGA GGA GAG CGT TGT GAC TTG GAG GAC 1968 
Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp 
645 650 655 

AGG GAT AGA TCA GAG CTT AGC CCG CTG CTG CTG TCT ACA ACA GAG TGG 2016 
Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp 
660 665 670 



HrVPNV.T 





CAG ATA CTG CCC TGT TCC TTC ACC ACC CTG CCG GCC CTA TCC ACC GGC 2064 
Gin lie Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly 
675 680 685 

CTG ATC CAC CTC CAT CAG AAC ATC GTG GAC GTG CAA TAC CTG TAC GGT 2112 
Leu He His Leu His Gin Asn He Val Asp Val Gin Tyr Leu Tyr Gly 
690 695 700 

GTA GGG TCG GCG GTT GTC TCC CTT GTC ATC AAA TGG GAG TAT GTC CTG 2160 
Val Gly Ser Ala Val Val Ser Leu Val He Lys Trp Glu Tyr Val Leu 
705 710 715 720 

TTG CTC TTC CTT CTC CTG GCA GAC GCG CGC ATC TGC GCC TGC TTA TGG 2208 
Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg He Cys Ala Cys Leu Trp 
725 730 735 

ATG ATG CTG CTG ATA GCT CAA GCT GAG GCC GCC TTA GAG AAC CTG GTG 2256 

Met Met Leu Leu He Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu Val 

^ 740 745 750 

Cjfc CTC AAT GCG GCG GCC GTG GCC GGG GCG CAT GGC ACT CTT TCC TTC 2304 

^ Leu Asn Ala Ala Ala Val Ala Gly Ala His Gly Thr Leu Ser Phe 
y 755 760 765 

C^T GTG TTC TTC TGT GCT GCC TGG TAC ATC AAG GGC AGG CTG GTC CCT 2352 

ll^ Val Phe Phe Cys Ala Ala Trp Tyr He Lys Gly Arg Leu Val Pro 

770 775 780 

d#r GCG GCA TAC GCC TTC TAT GGC GTG TGG CCG CTG CTC CTG CTT CTG 2400 
GB/ Ala Ala Tyr Ala Phe Tyr Gly Val Trp Pro Leu Leu Leu Leu Leu 
7^1 790 795 800 

Q^G GCC TTA CCA CCA CGA GCT TAT GCC TAGTAA 2433 
iMa Ala Leu Pro Pro Arg Ala Tyr Ala 
•^T 805 810 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 809 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
15 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

He Pro Lys Ala Arg Arg Pro Glu Gly Arg Ala Trp Ala Gin Pro Gly 
65 70 75 80 
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Tvr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Met Gly Trp Ala Gly Trp 
85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 
100 105 110 

Arq Arg Arg Ser Arg Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys 
115 120 125 

Gly Phe Ala Asp Leu Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu 
130 135 140 

Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
Q 165 170 175 

^he Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr 
*l 180 185 190 

U^lu Val Arg Asn Val Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser 
□ 195 200 205 

i^sn Ser Ser He Val Tyr Glu Ala Ala Asp Met He Met His Thr Pro 
210 215 220 

hfely Cys Val Pro Cys Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val 
Q25 230 235 240 

Su.a Leu Thr Pro Thr Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr 
t 245 250 255 

Hrhr He Arg Arg His Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys 
260 265 270 

Ser Ala Met Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser 
275 . 280 285 

Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys 
• 290 295 . 300 

Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg Met Ala Trp 
305 310 315 320 

Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val Val Ser Gin 
325 330 335 

Leu Leu Arg He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His 
340 345 350 

Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp 
355 360 365 

Ala Lys Val Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His 
370 375 380 

Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val 
385 390 395 400 
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Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys He Gin Leu Val Asn Thr 
405 410 415 

Asn Gly Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser 
420 425 430 

Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn 
435 440 445 

Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser He Asp Lys 
450 455 460 

Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser 
465 470 475 480 

Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He 
485 490 495 

Qal Pro Ala Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser 
^ 500 505 510 

^^ro Val Val Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn 
515 520 525 

9rp Gly Ala Asn Asp Ser Asp Val Leu He Leu Asn Asn Thr Arg Pro 
530 535 540 

m 

sPro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe 
1^45 550 555 560 

Shr Lys Thr Cys Gly Gly Pro Pro Cys Asn He Gly Gly Ala Gly Asn 
N 565 570 575 

a 

rgsn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala 
1^ 580 585 590 

Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met 

.^rJ 600 



595 



605 



Val His Tyr Pro Tyr Arg Leu Trp His. Tyr Pro Cys Thr Val Asn Phe 
610 615 620 

Thr He Phe Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Phe 

625 630 635 640 

Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp 
645 650 655 

Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp 
660 665 670 

Gin He Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly 
675 680 685 

Leu He His Leu His Gin Asn He Val Asp Val Gin Tyr Leu Tyr Gly 
690 695 700 

Val Gly Ser Ala Val Val Ser Leu Val He Lys Trp Glu Tyr Val Leu 

705 710 715 720 

Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg He Cys Ala Cys Leu Trp 
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725 730 735 

Met Met Leu Leu He Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu Val 
740 745 750 

Val Leu Asn Ala Ala Ala Val Ala Gly Ala His Gly Thr Leu Ser Phe 
755 760 765 

Leu Val Phe Phe Cys Ala Ala Trp Tyr He Lys Gly Arg Leu Val Pro 
770 775 780 

Gly Ala Ala Tyr Ala Phe Tyr Gly Val Trp Pro Leu Leu Leu Leu Leu 
785 790 795 800 

Leu Ala Leu Pro Pro Arg Ala Tyr Ala 
805 



J^) INFORMATION FOR SEQ ID NO: 51: 

^ (i) SEQUENCE CHARACTERISTICS: 
H (A) LENGTH: 17 amino acids 

m (B) TYPE: amino acid 

f^l (C) STRANDEDNESS : single 

^ (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

p (ix) FEATURE: 

y_ (A) NAME/KEY: Modif ied-site 

p (B) LOCATION: 1 . . 17 

P 

N= (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Ser Asn Ser Ser Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys 
15 10 15 

Val 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 1-.22 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Gly Gly He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 
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10 15 



Ser Pro Thr Thr Ala Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 
□ (A) NAME/KEY: Modif ied-site 

(B) LOCATION: 1..37 



""^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

□ Tyr Glu Val Arg Asn Val Ser Gly He Tyr His Val Thr Asn Asp Cys 
mi 5 10 15 



□ 



Ser Asn Ser Ser He Val Tyr Glu Ala Ala Asp Met He Met His Thr 
20 25 30 

Pro Gly Cys Gly Lys 
35 



^) INFORMATION FOR SEQ ID NO: 54: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 1..25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Gly Gly Thr Pro Thr Val Ala Thr Arg Asp Gly Lys Leu Pro Ala Thr 
15 10 15 

Gin Leu Arg Arg His He Asp Leu Leu 
20 25 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/BCEY: Modif ied-site 

(B) LOCATION: 1..25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Gly Gly Thr Pro Thr Leu Ala Ala Arg Asp Ala Ser Val Pro Thr Thr 
15 10 15 

Thr He Arg Arg His Val Asp Leu Leu 
20 25 



INFORMATION FOR SEQ ID NO: 56: 

M 

y (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

\^ (C) STRANDEDNESS: single 

Ul (D) TOPOLOGY: linear 

\^ (ii) MOLECULE TYPE: peptide 

o 

S (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

H Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Gin Val Arg Asn 
15 10 15 

Ser Thr Gly Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Gin Val Arg Asn Ser Thr Gly Leu Tyr His Val Thr Asn Asp Cys Pro 
15 10 15 

Asn Ser Ser He 
20 

(2) INFORMATION FOR SEQ ID NO: 58: 




(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Asn Asp Cys Pro Asn Ser Ser He Val Tyr Glu Ala His Asp Ala He 
15 10 15 



Leu His Thr Pro 
20 

INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 
■-^^^i (A) LENGTH: 20 amino acids 

ili (B) TYPE: amino acid 

^ (C) STRANDEDNESS: single 

y (D) TOPOLOGY: linear 

nj 

m (ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

|=& 

Q Ser Asn Ser Ser He Val Tyr Glu Ala Ala Asp Met He Met His Thr 
0 1 5 10 15 

Pro Gly Cys Val 
20 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

His Asp Ala He Leu His Thr Pro Gly Val Pro Cys Val Arg Glu Gly 
1 - 5 10 15 

Asn Val Ser 

(2) INFORMATION FOR SEQ ID NO: 61: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Cys Val Arg Glu Gly Asn Val Ser Arg Cys Trp Val Ala Met Thr Pro 
1 5 10 15 

Thr Val Ala Thr 
20 

iZ) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 
i|| (A) LENGTH: 20 amino acids 

\l (B) TYPE: amino acid 

1,1 (C) STRANDEDNESS: single 

S (D) TOPOLOGY: linear 



ni 
m 



(ii) MOLECULE TYPE: peptide 



Mi 

j-l (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62; 



Ala Met Thr Pro Thr Val Ala Thr Arg Asp Gly Lys Leu Pro Ala Thr 
1 5 10 15 

Gin Leu Arg Arg 
20 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Leu Pro Ala Thr Gin Leu Arg Arg His He Asp Leu Leu Val Gly Ser 
15 10 15 

Ala Thr Leu Cys 
20 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Leu Val Gly Ser Ala Thr Leu Cys Ser Ala Leu Tyr Val Gly Asp Leu 
1 • 5 10 15 

Cys Gly Ser Val 
20 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 
g (A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
■f (C) STRANDEDNESS: single 

S (D) TOPOLOGY: linear 

Si 

Ul (ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 



ly 

m 

M Gin Leu Phe Thr Phe Ser Pro Arg Arg His Trp Thr Thr Gin Gly Cys 
pi 5 10 15 



1=4 



Asn Cys Ser lie 
20 



!f€) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Thr Gin Gly Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His 
1 5 10 15 

Arg Met Ala Trp 
20 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp Ser Pro 
1 5 . 10 15 

Thr Ala Ala Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 
Q (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

w 



HI 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 



? Asn Trp Ser Pro Thr Ala Ala Leu Val Met Ala Gin Leu Leu Arg He 
g 1 5 10 15 

H Pro Gin Ala He 

O 20 

ei 

e2') INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

Leu Leu Arg He Pro Gin Ala He Leu Asp Met He Ala Gly Ala His 
15 10 15 

Trp Gly Val Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Ala Giy Ala His Trp Gly Val Leu Ala Gly He Ala Tyr Phe Ser Met 
15 10 15 

Val Gly Asn Met 
20 



(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
B (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

m 

s Val Val Leu Leu Leu Phe Ala Gly Val Asp Ala Glu Thr He Val Ser 
1 5 10 15 

P 

Gly Gly Gin Ala 
20 

P 

INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Ser Gly Leu Val Ser Leu Phe Thr Pro Gly Ala Lys(Gln)Asn lie Gin 
15 10 15 

Leu He Asn Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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it 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Gin Asn He Gin Leu lie Asn Thr Asn Giy Gin Trp His lie Asn Ser 
15 10 15 

Thr Aia Leu Asn 
20 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 
O (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 



. 5 



m 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Leu Asn Cys Asn Glu Ser Leu Asn Thr Gly Trp Trp Leu Ala Gly Leu 
15 10 15 



O He Tyr Gin His Lys 

N 20 

a 

INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

. (ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Ala Gly Leu He Tyr Gin His Lys Phe Asn Ser Ser Gly Cys Pro Glu 
15 10 15 

Arg Leu Ala Ser 
20 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Glv Cys Pro Glu Arg Leu Ala Ser Cys Arg Pro Leu Thr Asp Phe Asp 
15 10 15 

Gin Gly Trp Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



□ 



(ii) MOLECULE TYPE: peptide 



M 
W 

f1 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

Thr Asp Phe Asp Gin Gly Trp Gly Pro He Ser Tyr Ala Asn Gly Ser 
Ui ^ 5 10 15 

« 

H" Gly Pro Asp Gin 
□ 20 



) INFORMATION FOR SEQ ID NO: 78: 

p (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

Ala Asn Gly Ser Gly Pro Asp Gin Arg Pro Tyr Cys Trp His Tyr Pro 
15 10 15 

Pro Lys Pro Cys 
20 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Trp His Tyr Pro Pro Lys Pro Cys Gly lie Val Pro Ala Lys Ser Val 
15 10 15 

Cys Gly Pro Val 
20 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

2 (ii) MOLECULE TYPE: peptide 

y (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

□ 

Ala Lys Ser Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val 

m ' 

- Val Val Gly Thr 

M» 20 

Q 

(pi) INFORMATION FOR SEQ ID NO: 81: 

^ (i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 20 amino acids 

H- * (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Ser Gly Ala Pro Thr 
15 10 15 

Tyr Ser Trp Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

Gly Ala Pro Thr Tyr Ser Trp Gly Glu Asn Asp Thr Asp Val Phe Val 
1 5 10 15 



Leu Asn Asn Thr 
20 



(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

□ 

^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

S| 

h| Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe Thr Lys 
g 1 5 10 -15 

\% Val Cys Gly Ala 

a= 20 

.!#) INFORMATION FOR SEQ ID NO: 84: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



s 



(i) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

Gly Phe Thr Lys Val Cys Gly Ala Pro Pro Val Cys He Gly Gly Ala 
15 10 15 

Gly Asn Asn Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 



He Gly Gly Ala Gly Asn Asn Thr Leu His Cys Pro Thr Asp Cys Arg 
1 5 10 15 

Lys His Pro 



(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



D 

3 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

Thr Asp Cys Phe Arg Lys His Pro Asp Ala Thr Tyr Ser Arg Cys Gly 
1 5 10 15 

w 

CI Ser Gly Pro Trp 

nJ 20 



ij) INFORMATION FOR SEQ ID NO: 87: 

^ (i) SEQUENCE CHARACTERISTICS: 

"M (A) LENGTH: 20 amino acids 

M= (B) TYPE: amino acid 

□ (C) STRANDEDNESS: single 

g (D) TOPOLOGY: linear 

' (ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

Ser Arg Cys Gly Ser Gly Pro Trp He Thr Pro Arg Cys Leu Val Asp 
' 15 10 15 

Tyr Pro Tyr Arg 
20 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

Cys Leu Val Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr He 




1 5 

Asn Tyr Thr lie 
20 

(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



10 15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

P 

\Q Pro Cys Thr He Asn Tyr Thr He Phe Lys He Arg Met Tyr Val Gly 
^1 5 10 15 

Si 

1 E Gly Val Glu His 
20 

HI) INFORMATION FOR SEQ ID NO: 90: 

m 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

H (C) STRANDEDNESS: single 

M' (D) TOPOLOGY: linear 

S 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

Met Tyr Val Gly Gly Val Glu His Arg Leu Glu Ala Ala Cys Asn Trp 
15 10 15 

Thr Pro Gly Glu 
20 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

Ala Cvs Asn Trp Thr Pro Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp 
15 10 15 
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Arg Ser Glu Leu 
20 



(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92; 



•fl Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Thr Thr Thr 

SI 1 5 10 15 

w 

□ Gin Trp Gin Val 

m 20 

in 

1^) INFORMATION FOR SEQ ID NO: 93: 

s 

H (i) SEQUENCE CHARACTERISTICS: 

□ (A) LENGTH: 9 amino acids 
L=& (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

Tyr Gin Val Arg Asn Ser Thr Gly Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
ACGTCCGTAC GTTCGAATTA ATTAATCGA 
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(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 



p (xi) sequence description: seq id no: 95: 
'^-:tccggacg tgcactagct cccgtctgtg gtagtggtgg tagtgattat caattaattg 



M 

p) INFORMATION FOR SEQ ID NO: 96: 

m 



m 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



p (D) TOPOLOGY: linear 

Ms 

p (ii) MOLECULE TYPE: DNA (genomic) 

P (iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
GTTTAACCAC TGCATGATG 
(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
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GTCCCATCGA GTGCGGCTAC 
(2) INFOEIMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
£GTGACATGG TACATTCCGG ACACTTGGCG CACTTCATAA GCGGA 43 

'it) INFORMATION FOR SEQ ID NO: 99: 

P 

fy (i) SEQUENCE CHARACTERISTICS: 

IJl (A) LENGTH: 42 base pairs 

g (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

g (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

□ (iii) HYPOTHETICAL: NO 



(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
TGCCTCATAC ACAATGGAGC TCTGGGACGA GTCGTTCGTG AC 
(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
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rACCCAGCAG CGGGAGCTCT GTTGCTCCCG AACGCAGGGC AC 
(2) INFORMATION FOR SEQ ID NO: 101: 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



P (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

TigrCGTGGTG GGGACGGAGG CCTGCCTAGC TGCGAGCGTG GG 42 
{-^p INFORMATION FOR SEQ ID NO: 102: 

^ (i) SEQUENCE CHARACTERISTICS: 
& (A) LENGTH: 48 base pairs 

ni (B) TYPE: nucleic acid 

m (C) STRANDEDNESS: single 

a (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

M 

^ (iii) HYPOTHETICAL: NO 

o 

□ (iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
CGTTATGTGG CCCGGGTAGA TTGAGCACTG GCAGTCCTGC ACCGTCTC 48 
(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
CAGGGCCGTT CTAGGCCTCC ACTGCATCAT CATATCCCAA GC 42 
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(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

CCGGAATGTA CCATGTCACG AACGAC 26 

^) INFORMATION FOR SEQ ID NO: 105: 

igl (i) SEQUENCE CHARACTERISTICS: 
\| (A) LENGTH: 24 base pairs 

{.= (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
^ (D) TOPOLOGY: linear 

m 

HI (ii) MOLECULE TYPE: DNA (genomic) 
yi (iii) HYPOTHETICAL: NO 

P (iii) ANTI-SENSE: NO 

rn 

pj (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

GCTCCATTGT GTATGAGGCA GCGG 24 
(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
GAGCTCCCGC TGCTGGGTAG CGC 23 
(2) INFORMATION FOR SEQ ID NO: 107: 



HCVENV.-T 



4 



4 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
CCTCCGTCCC CACCACGACA ATACG 25 

(2) INFORMATION FOR SEQ ID NO: 108: 

□ 

tfl (i) SEQUENCE CHARACTERISTICS: 
2 (A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
^ (D) TOPOLOGY: linear 

ry (ii) MOLECULE TYPE: DNA (genomic) 

m 

(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

□ 

q (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

CTACCCGGGC CACATAACGG GTCACCG 27 
(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

GGAGGCCTAC AACGGCCCTG GTGG 

(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
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* (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

^ (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

TTCTATCGAT TAAATAGAAT TC . 

(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 
p (A) LENGTH: 23 base pairs 

J (B) TYPE: nucleic acid 

.7| (C) STRANDEDNESS: single 

^ (D) TOPOLOGY: linear 

W (ii) MOLECULE TYPE: DNA (genomic) 

o 

h) (iii) HYPOTHETICAL: NO 



111 



(iii) ANTI-SENSE: NO 



ft (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111; 



(^ATACGCT CACAGCCGAT CCC 23 
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